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IDENTITY-CODED SURFACE WITH REFERENCE POINTS 

FIELD OF INVENTION 

The present invention relates generally to a method of defining a region in 

relation to a surface, and to a region so defined. 
5 The invention has been developed primarily to provide the basis for a surface- 

based interface which allows a user to input and interact with information via a network, 
and to obtain interactive printed matter on demand via high-speed networked color 
printers. Although the invention will largely be described herein with reference to this 
use, it will be appreciated that the invention is not limited to use in this field. 



10 CO-PENDING APPLICATIONS 

Various methods, systems and apparatus relating to the present invention are 

disclosed in the following co-pending applications filed by the applicant or assignee of 

the present invention simultaneously with the present application: 

NPA001US, NPA002US, NPA003US, NPA004US, NPA005US, NPA006US, 
15 NPA007US, NPA008US, NPA009US, NPA010US, NPA012US, NPA016US, 

NPA017US, NPA018US, NPA019US, NPA020US, NPA021US, NPA030US, 
NPA035US, NPA048US, NPA050US, NPA051US, NPA052US, NPA075US, 
NPB001US, NPB002US, NPK002US, NPK003US, NPK004US, NPK005US, 
NPK007US, NPM001US, NPM002US, NPM003US, NPM004US, NPN001US, 
20 NPN002US, NPN003US, NPP001US, NPP002US, NPP003US, NPP005US, 

NPP006US, NPP007US, NPP008US, NPP016US, NPP017US, NPP018US, 
NPP019US, NPS001US, NPS003US, NPS020US, NPT001US, NPT002US, 
NPT003US, NPT004US, NPX001US, NPX003US, NPX008US, NPX011US, 
NPX014US, NPX016US, NPX020US, NPX022US, IJ52US, IJM52US, 
25 MJ10US, MJ1 1US, MJ12US, MJ13US, MJ14US, MJ15US, MJ34US, MJ47US, 

MJ52US, MJ58US, MJ62US, MJ63US, PAK04US, PAK05US, PAK06US, 
PAK07US, PAK08US, PEC01US, PEC02US. 

The disclosures of these co-pending applications are incorporated herein by cross- 
30 reference. Each application is temporarily identified by its docket number. This will be 
replaced by the corresponding USSN when available. 



NPT001US 



-2- 



BACKGROUND 

Presently, the majority of forms for accepting input from a user are either 
printed on paper or displayed on a computer screen. With a paper-based form, the user 
writes information on the form by hand, usually in spaces or boxes provided adjacent 
5 printed questions or instructions. The information is later keyed into a computer system 
for online storage processing and storage. With a screen-based form, the user enters 
information into the form via a computer keyboard. Both of these approaches have 
disadvantages. The screen-based approach is convenient to the form recipient, but 
requires that the user have access to a computer system. By contrast, the paper-based 
10 approach only requires the user to apply pen to paper, but requires the form recipient to 
wait for manual delivery of the form and perform data entry of its content. 

OBJECT 

It is an object of the present invention to combine advantages of paper-based 
forms and immediate online information capture. 

1 5 SUMMARY OF INVENTION 

According to a first aspect of the present invention, there is provided a region 

defined in relation to a surface, coded data being disposed within the region, wherein the 
coded data is indicative of: 

a region identity associated with the region; and 
20 a plurality of points within the region. 

According to a second aspect of the present invention, there is provided a 
surface, the surface including a region according to the first aspect. 

According to a third aspect of the present invention, there is provided a method 
of producing a surface having a region, the method including the steps of: 

25 (a) defining coded data, the coded data being indicative of: 

a region identity associated with the region; and 

a plurality of points within the region; 

(b) disposing the coded data within a region on the surface. 
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Preferably, the coded data includes at least one tag, each tag being indicative of 
the region identity and the position of the tag within the region. More preferably, each of 
the tags includes first identity data defining a relative position of that tag, and second 
identity data identifying the surface. 

5 In a particularly preferred form, the tags are printed onto a surface in the form 

of a piece of paper, and are configured to be read by a sensing device in the form of an 
optical sensing stylus. The tags are preferably printed using an ink that absorbs near 
infrared light but is substantially invisible to a human viewer under normal lighting 
conditions. When a user brings a sensing end of the stylus close to the surface, one or 
10 more of the tags are imaged, interpreted and decoded to provide an indication of the 
identity of the region from which the tag was imaged, and a position of the read tag 
within the region. This information can be used in a number of useful ways, including 
controlling a computer via interactive elements or providing input to a computer for 
storage, manipulation or display. 

15 Further aspects of the invention will become apparent from reading the 

following detailed description of preferred and other embodiments of the invention. 
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BRIEF DESCRIPTION OF DRAWINGS 

Preferred and other embodiments of the invention will now be described, by 

way of non-limiting example only, with reference to the accompanying drawings, in 
which: 

5 Figure 1 is a schematic of a the relationship between a sample printed netpage and its 
online page description; 

Figure 2 is a schematic view of a interaction between a netpage pen, a netpage printer, a 
netpage page server, and a netpage application server; 

Figure 3 illustrates a collection of netpage servers and printers interconnected via a 
1 0 network; 

Figure 4 is a schematic view of a high-level structure of a printed netpage and its online 
page description; 

Figure 5 is a plan view showing a structure of a netpage tag; 

Figure 6 is a plan view showing a relationship between a set of the tags shown in Figure 
15 5 and a field of view of a netpage sensing device in the form of a netpage pen; 

Figure 7 is a flowchart of a tag image processing and decoding algorithm; 

Figure 8 is a perspective view of a netpage pen and its associated tag-sensing field-of- 

view cone; 

Figure 9 is a perspective exploded view of the netpage pen shown in Figure 8; 

20 Figure 10 is a schematic block diagram of a pen controller for the netpage pen shown in 
Figures 8 and 9; 

Figure 1 1 is a perspective view of a wall-mounted netpage printer; 

Figure 12 is a section through the length of the netpage printer of Figure 11; 

Figure 12a is an enlarged portion of Figure 12 showing a section of the duplexed print 
25 engines and glue wheel assembly; 

Figure 13 is a detailed view of the ink cartridge, ink, air and glue paths, and print engines 
of the netpage printer of Figures 1 1 and 12; 

Figure 14 is a schematic block diagram of a printer controller for the netpage printer 
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shown in Figures 1 1 and 12; 

Figure 15 is a schematic block diagram of duplexed print engine controllers and 
Memjet™ printheads associated with the printer controller shown in Figure 14; 

Figure 16 is a schematic block diagram of the print engine controller shown in Figures 
5 14 and 15; 

Figure 17 is a perspective view of a single Memjet™ printing element, as used in, for 
example, the netpage printer of Figures 10 to 12; 

Figure 18 is a perspective view of a small part of an array of Memjet™ printing 
elements; 

1 0 Figure 19 is a series of perspective views illustrating the operating cycle of the Memjet™ 
printing element shown in Figure 13; 

Figure 20 is a perspective view of a short segment of a pagewidth Memjet™ printhead; 

Figure 21 is a schematic view of a user class diagram; 

Figure 22 is a schematic view of a printer class diagram; 
1 5 Figure 23 is a schematic view of a pen class diagram; 

Figure 24 is a schematic view of an application class diagram; 

Figure 25 is a schematic view of a document and page description class diagram; 

Figure 26 is a schematic view of a document and page ownership class diagram; 

Figure 27 is a schematic view of a terminal element specialization class diagram; 
20 Figure 28 is a schematic view of a static element specialization class diagram; 

Figure 29 is a schematic view of a hyperlink element class diagram; 

Figure 30 is a schematic view of a hyperlink element specialization class diagram; 

Figure 3 1 is a schematic view of a hyperlinked group class diagram; 

Figure 32 is a schematic view of a form class diagram; 
25 Figure 33 is a schematic view of a digital ink class diagram; 

Figure 34 is a schematic view of a field element specialization class diagram; 
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Figure 35 is a schematic view of a checkbox field class diagram; 

Figure 36 is a schematic view of a text field class diagram; 

Figure 37 is a schematic view of a signature field class diagram; 

Figure 38 is a flowchart of an input processing algorithm; 
5 Figure 38a is a detailed flowchart of one step of the flowchart of Figure 38; 

Figure 39 is a schematic view of a page server command element class diagram; 

Figure 40 is a schematic view of a resource description class diagram; 

Figure 41 is a schematic view of a favorites list class diagram; 

Figure 42 is a schematic view of a history list class diagram; 
1 0 Figure 43 is a schematic view of a subscription delivery protocol; 

Figure 44 is a schematic view of a hyperlink request class diagram; 

Figure 45 is a schematic view of a hyperlink activation protocol; 

Figure 46 is a schematic view of a form submission protocol; 

Figure 47 is a schematic view of a commission payment protocol; 
1 5 Figure 48 is a schematic view of a set of radial wedges making up a symbol; 

Figure 49 is a schematic view of a ring A and B symbol allocation scheme; 

Figure 50 is a schematic view of a first ring C and D symbol allocation scheme; 

Figure 51 is a schematic view of a second ring C and D symbol allocation scheme; 

Figure 52 is a schematic view of a triangular tag packing; 
20 Figure 53 is a perspective view of an icosahedron; 

Figure 54 is a perspective view of an icosahedral geodesic with frequency 3; 

Figure 55 is a schematic view of a minimum tag spacing; 

Figure 56 is a schematic view of a minimum tag spacing which avoids overlap; 

Figure 57 is a schematic view of a first tag insertion case; 
25 Figure 58 is a schematic view of a second tag insertion case; 
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Figure 59 is a schematic view of a third tag insertion case; 

Figure 60 is a schematic view of a fourth tag insertion case; 

Figure 61 is a schematic view of a pen orientation relative to a surface; 

Figure 62 is a schematic view of a pen pitch geometry; 

5 Figure 63 is a schematic view of a pen roll geometry; 

Figure 64 is a schematic view of a pen coordinate space showing physical and optical 
axes of a pen; 

Figure 65 is a schematic view of a curved nib geometry; 

Figure 66 is a schematic view of an interaction between sampling frequency and tag 
1 0 frequency; 

Figure 67 is a table containing equations numbered 1 to 10; 

Figure 68 is a table containing equations numbered 1 1 to 20; 

Figure 69 is a table containing equations numbered 21 to 26; 

Figure 70 is a table containing equations numbered 27 to 34; 
1 5 Figure 71 is a table containing equations numbered 35 to 41 ; 

Figure 72 is a table containing equations numbered 42 to 44; 

Figure 73 is a table containing equations numbered 45 to 47; 

Figure 74 is a table containing equations numbered 48 to 51; 

Figure 75 is a table containing equations numbered 52 to 54; 
20 Figure 76 is a table containing equations numbered 55 to 57; 

Figure 77 is a table containing equations numbered 58 to 59; 

Figure 78 is a table containing equations numbered 60 to 63; 

Figure 79 is a table containing equations numbered 64 to 74; 

Figure 80 is a table containing equations numbered 75 to 86; 
25 Figure 81 is a table containing equations numbered 87 to 99; 

Figure 82 is a table containing equations numbered 100 to 1 1 1 ; 
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Figure 83 is a table containing equations numbered 1 12 to 120; 

Figure 84 is a table containing equations numbered 121 to 129; 

Figure 85 is a table containing a set of degenerate forms of equations 64 to 71 ; 

Figure 86 is a first part of a table containing conditions and special handling for 
pitch and zero roll; and 

Figure 87 is a the second part of the table of Figure 86. 
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DETAILED DESCRIPTION OF PREFERRED AND OTHER EMBODIMENTS 

Note: Memjet™ is a trade mark of Silverbrook Research Pty Ltd, Australia. 

In the preferred embodiment, the invention is configured to work with the 
netpage networked computer system, a detailed overview of which follows. It will be 
5 appreciated that not every implementation will necessarily embody all or even most of 
the specific details and extensions discussed below in relation to the basic system. 
However, the system is described in its most complete form to reduce the need for 
external reference when attempting to understand the context in which the preferred 
embodiments and aspects of the present invention operate. 

10 In brief summary, the preferred form of the netpage system employs a computer 

interface in the form of a mapped surface, that is, a physical surface which contains 
references to a map of the surface maintained in a computer system. The map references 
can be queried by an appropriate sensing device. Depending upon the specific 
implementation, the map references may be encoded visibly or invisibly, and defined in 

15 such a way that a local query on the mapped surface yields an unambiguous map 
reference both within the map and among different maps. The computer system can 
contain information about features on the mapped surface, and such information can be 
retrieved based on map references supplied by a sensing device used with the mapped 
surface. The information thus retrieved can take the form of actions which are initiated 

20 by the computer system on behalf of the operator in response to the operator's interaction 
with the surface features. 

In its preferred form, the netpage system relies on the production of, and human 
interaction with, netpages. These are pages of text, graphics and images printed on 
ordinary paper, but which work like interactive web pages. Information is encoded on 
25 each page using ink which is substantially invisible to the unaided human eye. The ink, 
however, and thereby the coded data, can be sensed by an optically imaging pen and 
transmitted to the netpage system. 

In the preferred form, active buttons and hyperlinks on each page can be 
clicked with the pen to request information from the network or to signal preferences to a 
30 network server. In one embodiment, text written by hand on a netpage is automatically 
recognized and converted to computer text in the netpage system, allowing forms to be 
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filled in. In other embodiments, signatures recorded on a netpage are automatically 
verified, allowing e-commerce transactions to be securely authorized. 

As illustrated in Figure 1, a printed netpage 1 can represent a interactive form 
which can be filled in by the user both physically, on the printed page, and 
5 "electronically", via communication between the pen and the netpage system. The 
example shows a "Request" form containing name and address fields and a submit 
button. The netpage consists of graphic data 2 printed using visible ink, and coded data 3 
printed as a collection of tags 4 using invisible ink. The corresponding page description 
5, stored on the netpage network, describes the individual elements of the netpage. In 
10 particular it describes the type and spatial extent (zone) of each interactive element (i.e. 
text field or button in the example), to allow the netpage system to correctly interpret 
input via the netpage. The submit button 6, for example, has a zone 7 which corresponds 
to the spatial extent of the corresponding graphic 8. 

As illustrated in Figure 2, the netpage pen 101, a preferred form of which is 
1 5 shown in Figures 8 and 9 and described in more detail below, works in conjunction with 
a netpage printer 601, an Internet-connected printing appliance for home, office or 
mobile use. The pen is wireless and communicates securely with the netpage printer via 
a short-range radio link 9. 

The netpage printer 601, a preferred form of which is shown in Figures 11 to 
20 13 and described in more detail below, is able to deliver, periodically or on demand, 
personalized newspapers, magazines, catalogs, brochures and other publications, all 
printed at high quality as interactive netpages. Unlike a personal computer, the netpage 
' printer is an appliance which can be, for example, wall-mounted adjacent to an area 
where the morning news is first consumed, such as in a user's kitchen, near a breakfast 
25 table, or near the household's point of departure for the day. It also comes in tabletop, 
desktop, portable and miniature versions. 

Netpages printed at their point of consumption combine the ease-of-use of 
paper with the timeliness and interactivity of an interactive medium. 

As shown in Figure 2, the netpage pen 101 interacts with the coded data on a 
30 printed netpage 1 and communicates, via a short-range radio link 9, the interaction to a 
netpage printer. The printer 601 sends the interaction to the relevant netpage page server 
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10 for interpretation. In appropriate circumstances, the page server sends a corresponding 
message to application computer software running on a netpage application server 13. 
The application server may in turn send a response which is printed on the originating 
printer. 

5 The netpage system is made considerably more convenient in the preferred 

embodiment by being used in conjunction with high-speed microelectromechanical 
system (MEMS) based inkjet (Memjet™) printers. In the preferred form of this 
technology, relatively high-speed and high-quality printing is made more affordable to 
consumers. In its preferred form, a netpage publication has the physical characteristics of 
10 a traditional newsmagazine, such as a set of letter-size glossy pages printed in full color 
on both sides, bound together for easy navigation and comfortable handling. 

The netpage printer exploits the growing availability of broadband Internet 
access. Cable service is available to 95% of households in the United States, and cable 
modem service offering broadband Internet access is already available to 20% of these. 

1 5 The netpage printer can also operate with slower connections, but with longer delivery 
times and lower image quality. Indeed, the netpage system can be enabled using existing 
consumer inkjet and laser printers, although the system will operate more slowly and will 
therefore be less acceptable from a consumer's point of view. In other embodiments, the 
netpage system is hosted on a private intranet. In still other embodiments, the netpage 

20 system is hosted on a single computer or computer-enabled device, such as a printer. 

Netpage publication servers 14 on the netpage network are configured to 
deliver print-quality publications to netpage printers. Periodical publications are 
delivered automatically to subscribing netpage printers via pointcasting and multicasting 
Internet protocols. Personalized publications are filtered and formatted according to 
25 individual user profiles. 

A netpage printer can be configured to support any number of pens, and a pen 
can work with any number of netpage printers. In the preferred implementation, each 
netpage pen has a unique identifier. A household may have a collection of colored 
netpage pens, one assigned to each member of the family. This allows each user to 
30 maintain a distinct profile with respect to a netpage publication server or application 
server. 
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A netpage pen can also be registered with a netpage registration server 1 1 and 
linked to one or more payment card accounts. This allows e-commerce payments to be 
securely authorized using the netpage pen. The netpage registration server compares the 
signature captured by the netpage pen with a previously registered signature, allowing it 
5 to authenticate the user's identity to an e-commerce server. Other biometrics can also be 
used to verify identity. A version of the netpage pen includes fingerprint scanning, 
verified in a similar way by the netpage registration server. 

Although a netpage printer may deliver periodicals such as the morning 
newspaper without user intervention, it can be configured never to deliver unsolicited 
1 0 junk mail. In its preferred form, it only delivers periodicals from subscribed or otherwise 
authorized sources. In this respect, the netpage printer is unlike a fax machine or e-mail 
account which is visible to any junk mailer who knows the telephone number or email 
address. 

1 Netpage System Architecture 

15 Each object model in the system is described using a Unified Modeling 

Language (UML) class diagram. A class diagram consists of a set of object classes 
connected by relationships, and two kinds of relationships are of interest here: 
associations and generalizations. An association represents some kind of relationship 
between objects, i.e. between instances of classes. A generalization relates actual classes, 
20 and can be understood in the following way: if a class is thought of as the set of all 
objects of that class, and class A is a generalization of class B, then B is simply a subset 
of A. The UML does not directly support second-order modelling - i.e. classes of classes. 

Each class is drawn as a rectangle labelled with the name of the class. It 
contains a list of the attributes of the class, separated from the name by a horizontal line, 
25 and a list of the operations of the class, separated from the attribute list by a horizontal 
line. In the class diagrams which follow, however, operations are never modelled. 

An association is drawn as a line joining two classes, optionally labelled at 
either end with the multiplicity of the association. The default multiplicity is one. An 
asterisk (*) indicates a multiplicity of "many", i.e. zero or more. Each association is 
30 optionally labelled with its name, and is also optionally labelled at either end with the 
role of the corresponding class. An open diamond indicates an aggregation association 
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("is-part-of '), and is drawn at the aggregator end of the association line. 

A generalization relationship ("is-a") is drawn as a solid line joining two 
classes, with an arrow (in the form of an open triangle) at the generalization end. 

When a class diagram is broken up into multiple diagrams, any class which is 
5 duplicated is shown with a dashed outline in all but the main diagram which defines it. It 
is shown with attributes only where it is defined. 

1.1 Netpages 

Netpages are the foundation on which a netpage network is built. They provide 

a paper-based user interface to published information and interactive services. 

10 A netpage consists of a printed page (or other surface region) invisibly tagged 

with references to an online description of the page. The online page description is 
maintained persistently by a netpage page server. The page description describes the 
visible layout and content of the page, including text, graphics and images. It also 
describes the input elements on the page, including buttons, hyperlinks, and input fields. 

15 A netpage allows markings made with a netpage pen on its surface to be simultaneously 
captured and processed by the netpage system. 

Multiple netpages can share the same page description. However, to allow 
input through otherwise identical pages to be distinguished, each netpage is assigned a 
unique page identifier. This page ID has sufficient precision to distinguish between a 
20 very large number of netpages. 

Each reference to the page description is encoded in a printed tag. The tag 
identifies the unique page on which it appears, and thereby indirectly identifies the page 
description. The tag also identifies its own position on the page. Characteristics of the 
tags are described in more detail below. 

25 Tags are printed in infrared-absorptive ink on any substrate which is infrared- 

reflective, such as ordinary paper. Near-infrared wavelengths are invisible to the human 
eye but are easily sensed by a solid-state image sensor with an appropriate filter. 

A tag is sensed by an area image sensor in the netpage pen, and the tag data is 
transmitted to the netpage system via the nearest netpage printer. The pen is wireless and 
30 communicates with the netpage printer via a short-range radio link. Tags are sufficiently 
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small and densely arranged that the pen can reliably image at least one tag even on a 
single click on the page. It is important that the pen recognize the page ID and position 
on every interaction with the page, since the interaction is stateless. Tags are error- 
correctably encoded to make them partially tolerant to surface damage. 

5 The netpage page server maintains a unique page instance for each printed 

netpage, allowing it to maintain a distinct set of user-supplied values for input fields in 
the page description for each printed netpage. 

The relationship between the page description, the page instance, and the 
printed netpage is shown in Figure 4. The page instance is associated with both the 
1 0 netpage printer which printed it and, if known, the netpage user who requested it. 

1.2 Netpage Tags 

1.2.1 Tag Data Content 

In a preferred form, each tag identifies the region in which it appears, and the 

location of that tag within the region. A tag may also contain flags which relate to the 
1 5 region as a whole or to the tag. One or more flag bits may, for example, signal a tag 
sensing device to provide feedback indicative of a function associated with the 
immediate area of the tag, without the sensing device having to refer to a description of 
the region. A netpage pen may, for example, illuminate an "active area" LED when in 
the zone of a hyperlink. 

20 As will be more clearly explained below, in a preferred embodiment, each tag 

contains an easily recognized invariant structure which aids initial detection, and which 
assists in minimizing the effect of any warp induced by the surface or by the sensing 
process. The tags preferably tile the entire page, and are sufficiently small and densely 
arranged that the pen can reliably image at least one tag even on a single click on the 

25 page. It is important that the pen recognize the page ID and position on every interaction 
with the page, since the interaction is stateless. 

In a preferred embodiment, the region to which a tag refers coincides with an 
entire page, and the region ID encoded in the tag is therefore synonymous with the page 
ID of the page on which the tag appears. In other embodiments, the region to which a tag 
30 refers can be an arbitrary subregion of a page or other surface. For example, it can 
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coincide with the zone of an interactive element, in which case the region ID can directly 
identify the interactive element. 

Table l.Tag data 



Field 


Precision (bits) 


Region ID 


100 


Tag ID 


16 


Flags 


4 


Total 


120 



5 Each tag contains 120 bits of information, typically allocated as shown in Table 

1. Assuming a maximum tag density of 64 per square inch, a 16-bit tag ID supports a 
region size of up to 1024 square inches. Larger regions can be mapped continuously 
without increasing the tag ID precision simply by using abutting regions and maps. The 
100-bit region ID allows 2 100 (~10 30 or a million trillion trillion) different regions to be 
1 0 uniquely identified. 

1.2.2 Tag Data Encoding 

The 120 bits of tag data are redundantly encoded using a (15, 5) Reed-Solomon 

code. This yields 360 encoded bits consisting of 6 codewords of 15 4-bit symbols each. 
The (15, 5) code allows up to 5 symbol errors to be corrected per codeword, i.e. it is 
1 5 tolerant of a symbol error rate of up to 33% per codeword. 

Each 4-bit symbol is represented in a spatially coherent way in the tag, and the 
symbols of the six codewords are interleaved spatially within the tag. This ensures that a 
burst error (an error affecting multiple spatially adjacent bits) damages a minimum 
number of symbols overall and a minimum number of symbols in any one codeword, 
20 thus maximising the likelihood that the burst error can be fully corrected. 

1.2.3 Physical Tag Structure 

The physical representation of the tag, shown in Figure 5, includes fixed target 

structures 15, 16, 17 and variable data areas 18. The fixed target structures allow a 
sensing device such as the netpage pen to detect the tag and infer its three-dimensional 
25 orientation relative to the sensor. The data areas contain representations of the individual 
bits of the encoded tag data. 
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To achieve proper tag reproduction, the tag is rendered at a resolution of 
256x256 dots. When printed at 1600 dots per inch this yields a tag with a diameter of 
about 4 mm. At this resolution the tag is designed to be surrounded by a "quiet area" of 
radius 16 dots. Since the quiet area is also contributed by adjacent tags, it only adds 16 
5 dots to the effective diameter of the tag. 

The tag includes six target structures: a detection ring 15; an orientation axis 
target 16; and four perspective targets 17. 

The detection ring 15 allows the sensing device to initially detect the tag 4. The 
ring is easy to detect because it is rotationally invariant and because a simple correction 

10 of its aspect ratio removes most of the effects of perspective distortion. The orientation 
axis 16 allows the sensing device to determine the approximate planar orientation of the 
tag due to the yaw of the sensor. The orientation axis is skewed to yield a unique 
orientation. The four perspective targets 17 allow the sensing device to infer an accurate 
two-dimensional perspective transform of the tag and hence an accurate three- 

1 5 dimensional position and orientation of the tag relative to the sensor. 

All target structures are redundantly large to improve their immunity to noise. 

The overall tag shape is circular. This supports, amongst other things, optimal 
tag packing on an irregular triangular grid. In combination with the circular detection 
ring 15, this makes a circular arrangement of data bits within the tag optimal. To 

20 maximise its size, each data bit is represented by a radial wedge 510 in the form of an 
area bounded by two radial lines 512, a radially inner arc 514 and a radially outer arc 
516. Each wedge 510 has a minimum dimension of 8 dots at 1600 dpi and is designed so 
that its base (i.e. its inner arc 514), is at least equal to this minimum dimension. The 
radial height of the wedge 510 is always equal to the minimum dimension. Each 4-bit 

25 data symbol is represented by an array 518 of 2x2 wedges 510, as best shown in Figure 
48. 

The 15 4-bit data symbols of each of the six codewords are allocated to the four 
concentric symbol rings 18a to 18d, shown in Figure 5, in interleaved fashion as shown 
in Figures 49 to 51. Symbols of first to sixth codewords 520-525 are allocated alternately 
30 in circular progression around the tag. 
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The interleaving is designed to maximise the average spatial distance between 
any two symbols of the same codeword. 

In order to support "single-click" interaction with a tagged region via a sensing 
device, the sensing device must be able to see at least one entire tag in its field of view 
5 no matter where in the region or at what orientation it is positioned. The required 
diameter of the field of view of the sensing device is therefore a function of the size and 
spacing of the tags. 

Assuming a circular tag shape, the minimum diameter of the sensor field of 
view is obtained when the tags are tiled on a equilateral triangular grid, as shown in 
10 Figure 6. 

1.2.4 Tag Image Processing and Decoding 

The tag image processing and decoding performed by a sensing device such as 

the netpage pen is shown in Figure 7. While a captured image is being acquired from the 
image sensor, the dynamic range of the image is determined (at 20). The center of the 
15 range is then chosen as the binary threshold for the image 21. The image is then 
thresholded and segmented into connected pixel regions (i.e. shapes 23) (at 22). Shapes 
which are too small to represent tag target structures are discarded. The size and centroid 
of each shape is also computed. 

Binary shape moments 25 are then computed (at 24) for each shape, and these 
20 provide the basis for subsequently locating target structures. Central shape moments are 
by their nature invariant of position, and can be easily made invariant of scale, aspect 
ratio and rotation. 

The ring target structure 15 is the first to be located (at 26). A ring has the 
advantage of being very well behaved when perspective-distorted. Matching proceeds by 
25 aspect-normalizing and rotation-normalizing each shape's moments. Once its second- 
order moments are normalized the ring is easy to recognize even if the perspective 
distortion was significant. The ring's original aspect and rotation 27 together provide a 
useful approximation of the perspective transform. 

The axis target structure 16 is the next to be located (at 28). Matching proceeds 
30 by applying the ring's normalizations to each shape's moments, and rotation-normalizing 
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the resulting moments. Once its second-order moments are normalized the axis target is 
easily recognized. Note that one third order moment is required to disambiguate the two 
possible orientations of the axis. The shape is deliberately skewed to one side to make 
this possible. Note also that it is only possible to rotation-normalize the axis target after 
5 it has had the ring's normalizations applied, since the perspective distortion can hide the 
axis target's axis. The axis target's original rotation provides a useful approximation of 
the tag's rotation due to pen yaw 29. 

The four perspective target structures 17 are the last to be located (at 30). Good 
estimates of their positions are computed based on their known spatial relationships to 

10 the ring and axis targets, the aspect and rotation of the ring, and the rotation of the axis. 
Matching proceeds by applying the ring's normalizations to each shape's moments. Once 
their second-order moments are normalized the circular perspective targets are easy to 
recognize, and the target closest to each estimated position is taken as a match. The 
original centroids of the four perspective targets are then taken to be the perspective- 

1 5 distorted comers 3 1 of a square of known size in tag space, and an eight-degree-of- 
freedom perspective transform 33 is inferred (at 32) based on solving the well- 
understood equations relating the four tag-space and image-space point pairs. 

The inferred tag-space to image-space perspective transform is used to project 
(at 36) each known data bit position in tag space into image space where the real-valued 
20 position is used to bilinearly interpolate (at 36) the four relevant adjacent pixels in the 
input image. The previously computed image threshold 21 is used to threshold the result 
to produce the final bit value 37. 

Once all 360 data bits 37 have been obtained in this way, each of the six 60-bit 
Reed-Solomon codewords is decoded (at 38) to yield 20 decoded bits 39, or 120 decoded 
25 bits in total. Note that the codeword symbols are sampled in codeword order, so that 
codewords are implicitly de-interleaved during the sampling process. 

The ring target 1 5 is only sought in a subarea of the image whose relationship 
to the image guarantees that the ring, if found, is part of a complete tag. If a complete tag 
is not found and successfully decoded, then no pen position is recorded for the current 
30 frame. Given adequate processing power and ideally a non-minimal field of view 193, an 
alternative strategy involves seeking another tag in the current image. 
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The obtained tag data indicates the identity of the region containing the tag and 
the position of the tag within the region. An accurate position 35 of the pen nib in the 
region, as well as the overall orientation 35 of the pen, is then inferred (at 34) from the 
perspective transform 33 observed on the tag and the known spatial relationship between 
5 the pen's physical axis and the pen's optical axis. 

1.2.5 Tag Map 

Decoding a tag results in a region ID, a tag ID, and a tag-relative pen transform. 
Before the tag ID and the tag-relative pen location can be translated into an absolute 
location within the tagged region, the location of the tag within the region must be 
1 0 known. This is given by a tag map, a function which maps each tag ID in a tagged region 
to a corresponding location. The tag map class diagram is shown in Figure 22, as part of 
the netpage printer class diagram. 

A tag map reflects the scheme used to tile the surface region with tags, and this 
can vary according to surface type. When multiple tagged regions share the same tiling 
1 5 scheme and the same tag numbering scheme, they can also share the same tag map. 

The tag map for a region must be retrievable via the region ID. Thus, given a 
region ID, a tag ID and a pen transform, the tag map can be retrieved, the tag ID can be 
translated into an absolute tag location within the region, and the tag-relative pen 
location can be added to the tag location to yield an absolute pen location within the 
20 region. 

1.2.6 Tagging Schemes 

Two distinct surface coding schemes are of interest, both of which use the tag 

structure described earlier in this section. The preferred coding scheme uses "location- 
indicating" tags as already discussed. An alternative coding scheme uses object- 
25 indicating tags. 

A location-indicating tag contains a tag ID which, when translated through the 
tag map associated with the tagged region, yields a unique tag location within the region. 
The tag-relative location of the pen is added to this tag location to yield the location of 
the pen within the region. This in turn is used to determine the location of the pen 
30 relative to a user interface element in the page description associated with the region. 
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Not only is the user interface element itself identified, but a location relative to the user 
interface element is identified. Location-indicating tags therefore trivially support the 
capture of an absolute pen path in the zone of a particular user interface element. 

An object-indicating tag contains a tag ID which directly identifies a user 
5 interface element in the page description associated with the region. All the tags in the 
zone of the user interface element identify the user interface element, making them all 
identical and therefore indistinguishable. Object-indicating tags do not, therefore, 
support the capture of an absolute pen path. They do, however, support the capture of a 
relative pen path. So long as the position sampling frequency exceeds twice the 
1 0 encountered tag frequency, the displacement from one sampled pen position to the next 
within a stroke can be unambiguously determined. 

Assume a sampling wavelength of Xs and a tag wavelength of X T , with a 
relationship as defined in EQ 38. For two adjacent position samples P, and P M , one of 
EQ 39 and EQ 40 will hold. 

1 5 Assuming both equations hold leads to the relationship defined in EQ 41. 

Since EQ 41 contradicts EQ 38, the assumption that both EQ 39 and EQ 40 
hold must be incorrect, and the choice is therefore unambiguous, as stated. 

The illustration in Figure 60 shows four tags 500 and a one-dimensional stroke 
of six sample positions 582 which satisfy EQ 38. Possible aliases 584 of the sample 
20 positions are also shown. From inspection, if the distance from one sample position to 
the next is X s , then the distance from a sample position to the alias of the next sample 
position exceeds Xs. 

If the tag wavelength X T is 4.7 mm, as discussed in earlier, then the sampling 
wavelength X s must be less than 2.35 mm. If the temporal sampling frequency is 100 Hz 
25 as required for accurate handwriting recognition, then the pen speed must be less than 
235 mm/s to satisfy EQ 38. 

With either tagging scheme, the tags function in cooperation with associated 
visual elements on the netpage as user interactive elements in that a user can interact 
with the printed page using an appropriate sensing device in order for tag data to be read 
30 by the sensing device and for an appropriate response to be generated in the netpage 
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system. 

1 .3 Document and Page Descriptions 

A preferred embodiment of a document and page description class diagram is 

shown in Figures 25 and 26. 

5 In the netpage system a document is described at three levels. At the most 

abstract level the document 836 has a hierarchical structure whose terminal elements 839 
are associated with content objects 840 such as text objects, text style objects, image 
objects, etc. Once the document is printed on a printer with a particular page size and 
according to a particular user's scale factor preference, the document is paginated and 

10 otherwise formatted. Formatted terminal elements 835 will in some cases be associated 
with content objects which are different from those associated with their corresponding 
terminal elements, particularly where the content objects are style-related. Each printed 
instance of a document and page is also described separately, to allow input captured 
through a particular page instance 830 to be recorded separately from input captured 

1 5 through other instances of the same page description. 

The presence of the most abstract document description on the page server 
allows a user to request a copy of a document without being forced to accept the source 
document's specific format. The user may be requesting a copy through a printer with a 
different page size, for example. Conversely, the presence of the formatted document 
20 description on the page server allows the page server to efficiently interpret user actions 
on a particular printed page. 

A formatted document 834 consists of a set of formatted page descriptions 5, 
each of which consists of a set of formatted terminal elements 835. Each formatted 
element has a spatial extent or zone 58 on the page. This defines the active area of input 
25 elements such as hyperlinks and input fields. 

A document instance 83 1 corresponds to a formatted document 834. It consists 
of a set of page instances 830, each of which corresponds to a page description 5 of the 
formatted document. Each page instance 830 describes a single unique printed netpage 1, 
and records the page ID 50 of the netpage. A page instance is not part of a document 
30 instance if it represents a copy of a page requested in isolation. 
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A page instance consists of a set of terminal element instances 832. An element 
instance only exists if it records instance-specific information. Thus, a hyperlink instance 
exists for a hyperlink element because it records a transaction ID 55 which is specific to 
the page instance, and a field instance exists for a field element because it records input 
5 specific to the page instance. An element instance does not exist, however, for static 
elements such as textflows. 

A terminal element can be a static element 843, a hyperlink element 844, a 
field element 845 or a page server command element 846, as shown in Figure 27. A 
static element 843 can be a style element 847 with an associated style object 854, a 
10 textflow element 848 with an associated styled text object 855, an image element 849 
with an associated image element 856, a graphic element 850 with an associated graphic 
object 857, a video clip element 851 with an associated video clip object 858, an audio 
clip element 852 with an associated audio clip object 859, or a script element 853 with 
an associated script object 860, as shown in Figure 28. 

15 A page instance has a background field 833 which is used to record any digital 

ink captured on the page which does not apply to a specific input element. 

In the preferred form of the invention, a tag map 81 1 is associated with each 
page instance to allow tags on the page to be translated into locations on the page. 

1 .4 The Netpage Network 

20 In a preferred embodiment, a netpage network consists of a distributed set of 

netpage page servers 10, netpage registration servers 11, netpage ID servers 12, netpage 
application servers 13, netpage publication servers 14, and netpage printers 601 
connected via a network 19 such as the Internet, as shown in Figure 3. 

The netpage registration server 11 is a server which records relationships 
25 between users, pens, printers, applications and publications, and thereby authorizes 
various network activities. It authenticates users and acts as a signing proxy on behalf of 
authenticated users in application transactions. It also provides handwriting recognition 
services. As described above, a netpage page server 10 maintains persistent information 
about page descriptions and page instances. The netpage network includes any number of 
30 page servers, each handling a subset of page instances. Since a page server also 
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maintains user input values for each page instance, clients such as netpage printers send 
netpage input directly to the appropriate page server. The page server interprets any such 
input relative to the description of the corresponding page. 

A netpage ID server 12 allocates document IDs 51 on demand, and provides 
5 load-balancing of page servers via its ID allocation scheme. 

A netpage printer uses the Internet Distributed Name System (DNS), or similar, 
to resolve a netpage page ID 50 into the network address of the netpage page server 
handling the corresponding page instance. 

A netpage application server 13 is a server which hosts interactive netpage 
10 applications. A netpage publication server 14 is an application server which publishes 
netpage documents to netpage printers. They are described in detail in Section 2. 

Netpage servers can be hosted on a variety of network server platforms from 
manufacturers such as IBM, Hewlett-Packard, and Sun. Multiple netpage servers can run 
concurrently on a single host, and a single server can be distributed over a number of 
1 5 hosts. Some or all of the functionality provided by netpage servers, and in particular the 
functionality provided by the ID server and the page server, can also be provided directly 
in a netpage appliance such as a netpage printer, in a computer workstation, or on a local 
network. 

1 .5 The Netpage Printer 

20 The netpage printer 601 is an appliance which is registered with the netpage 

system and prints netpage documents on demand and via subscription. Each printer has a 
unique printer ID 62, and is connected to the netpage network via a network such as the 
Internet, ideally via a broadband connection. 

Apart from identity and security settings in non-volatile memory, the netpage 
25 printer contains no persistent storage. As far as a user is concerned, "the network is the 
computer". Netpages function interactively across space and time with the help of the 
distributed netpage page servers 10, independently of particular netpage printers. 

The netpage printer receives subscribed netpage documents from netpage 
publication servers 14. Each document is distributed in two parts: the page layouts, and 
30 the actual text and image objects which populate the pages. Because of personalization, 
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page layouts are typically specific to a particular subscriber and so are pointcast to the 
subscriber's printer via the appropriate page server. Text and image objects, on the other 
hand, are typically shared with other subscribers, and so are multicast to all subscribers' 
printers and the appropriate page servers. 

5 The netpage publication server optimizes the segmentation of document 

content into pointcasts and multicasts. After receiving the pointcast of a document's page 
layouts, the printer knows which multicasts, if any, to listen to. 

Once the printer has received the complete page layouts and objects that define 
the document to be printed, it can print the document. 

10 The printer rasterizes and prints odd and even pages simultaneously on both 

sides of the sheet. It contains duplexed print engine controllers 760 and print engines 
utilizing Memjet™ printheads 350 for this purpose. 

The printing process consists of two decoupled stages: rasterization of page 
descriptions, and expansion and printing of page images. The raster image processor 
15 (RIP) consists of one or more standard DSPs 757 running in parallel. The duplexed print 
engine controllers consist of custom processors which expand, dither and print page 
images in real time, synchronized with the operation of the printheads in the print 
engines. 

Printers not enabled for IR printing have the option to print tags using IR- 
20 absorptive black ink, although this restricts tags to otherwise empty areas of the page. 
Although such pages have more limited functionality than IR-printed pages, they are still 
classed as netpages. 

A normal netpage printer prints netpages on sheets of paper. More specialised 
netpage printers may print onto more specialised surfaces, such as globes. Each printer 
25 supports at least one surface type, and supports at least one tag tiling scheme, and hence 
tag map, for each surface type. The tag map 811 which describes the tag tiling scheme 
actually used to print a document becomes associated with that document so that the 
document's tags can be correctly interpreted. 

Figure 2 shows the netpage printer class diagram, reflecting printer-related 
30 information maintained by a registration server 1 1 on the netpage network. 
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A preferred embodiment of the netpage printer is described in greater detail in 
Section 6 below, with reference to Figures 1 1 to 16. 

1.5-1 Memjet™ Printheads 

The netpage system can operate using printers made with a wide range of 

5 digital printing technologies, including thermal inkjet, piezoelectric inkjet, laser 
electrophotographic, and others. However, for wide consumer acceptance, it is desirable 
that a netpage printer have the following characteristics: 

• photographic quality color printing 

• high quality text printing 
10 • high reliability 

• low printer cost 

• low ink cost 

• low paper cost 

• simple operation 

15 • nearly silent printing 

• high printing speed 

• simultaneous double sided printing 

• compact form factor 

• low power consumption 

20 No commercially available printing technology has all of these characteristics. 

To enable to production of printers with these characteristics, the present 
applicant has invented a new print technology, referred to as Memjet™ technology. 
Memjet™ is a drop-on-demand inkjet technology that incorporates page width printheads 
fabricated using microelectromechanical systems (MEMS) technology. Figure 17 shows 
25 a single printing element 300 of a Memjet™ printhead. The netpage wallprinter 
incorporates 168960 printing elements 300 to form a 1600 dpi pagewidth duplex printer. 
This printer simultaneously prints cyan, magenta, yellow, black, and infrared inks as well 
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as paper conditioner and ink fixative. 

The printing element 300 is approximately 110 microns long by 32 microns 
wide. Arrays of these printing elements are formed on a silicon substrate 301 that 
incorporates CMOS logic, data transfer, timing, and drive circuits (not shown). 

5 Major elements of the printing element 300 are the nozzle 302, the nozzle rim 

303, the nozzle chamber 304, the fluidic seal 305, the ink channel rim 306, the lever arm 
307, the active actuator beam pair 308, the passive actuator beam pair 309, the active 
actuator anchor 310, the passive actuator anchor 311, and the ink inlet 312. 

The active actuator beam pair 308 is mechanically joined to the passive 
1 0 actuator beam pair 309 at the join 319. Both beams pairs are anchored at their respective 
anchor points 310 and 311. The combination of elements 308, 309, 310, 311, and 319 
form a cantilevered electrothermal bend actuator 320. 

Figure 18 shows a small part of an array of printing elements 300, including a 
cross section 315 of a printing element 300. The cross section 315 is shown without ink, 
15 to clearly show the ink inlet 312 that passes through the silicon wafer 301 . 

Figures 19(a), 19(b) and 19(c) show the operating cycle of a Memjet™ printing 
element 300. 

Figure 19(a) shows the quiescent position of the ink meniscus 316 prior to 
printing an ink droplet. Ink is retained in the nozzle chamber by surface tension at the ink 
20 meniscus 316 and at the fluidic seal 305 formed between the nozzle chamber 304 and the 
ink channel rim 306. 

While printing, the printhead CMOS circuitry distributes data from the print 
engine controller to the correct printing element, latches the data, and buffers the data to 
drive the electrodes 318 of the active actuator beam pair 308. This causes an electrical 

25 current to pass through the beam pair 308 for about one microsecond, resulting in Joule 
heating. The temperature increase resulting from Joule heating causes the beam pair 308 
to expand. As the passive actuator beam pair 309 is not heated, it does not expand, 
resulting in a stress difference between the two beam pairs. This stress difference is 
partially resolved by the cantilevered end of the electrothermal bend actuator 320 

30 bending towards the substrate 301. The lever arm 307 transmits this movement to the 
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nozzle chamber 304. The nozzle chamber 304 moves about two microns to the position 
shown in Figure 19(b). This increases the ink pressure, forcing ink 321 out of the nozzle 
302, and causing the ink meniscus 316 to bulge. The nozzle rim 303 prevents the ink 
meniscus 316 from spreading across the surface of the nozzle chamber 304. 

5 As the temperature of the beam pairs 308 and 309 equalizes, the actuator 320 

returns to its original position. This aids in the break-off of the ink droplet 317 from the 
ink 321 in the nozzle chamber, as shown in Figure 19(c). The nozzle chamber is refilled 
by the action of the surface tension at the meniscus 316. 

Figure 20 shows a segment of a printhead 350. In a netpage printer, the length 
10 of the printhead is the full width of the paper (typically 210 mm) in the direction 351. 
The segment shown is 0.4 mm long (about 0.2% of a complete printhead). When 
printing, the paper is moved past the fixed printhead in the direction 352. The printhead 
has 6 rows of interdigitated printing elements 300, printing the six colors or types of ink 
supplied by the ink inlets 312. 

15 To protect the fragile surface of the printhead during operation, a nozzle guard 

wafer 330 is attached to the printhead substrate 301. For each nozzle 302 there is a 
corresponding nozzle guard hole 331 through which the ink droplets are fired. To prevent 
the nozzle guard holes 331 from becoming blocked by paper fibers or other debris, 
filtered air is pumped through the air inlets 332 and out of the nozzle guard holes during 

20 printing. To prevent ink 321 from drying, the nozzle guard is sealed while the printer is 
idle. 

1.6 The Netpage Pen 

The active sensing device of the netpage system is typically a pen 101, which, 
using its embedded controller 134, is able to capture and decode IR position tags from a 

25 page via an image sensor. The image sensor is a solid-state device provided with an 
appropriate filter to permit sensing at only near-infrared wavelengths. As described in 
more detail below, the system is able to sense when the nib is in contact with the surface, 
and the pen is able to sense tags at a sufficient rate to capture human handwriting (i.e. at 
200 dpi or greater and 100 Hz or faster). Information captured by the pen is encrypted 

30 and wirelessly transmitted to the printer (or base station), the printer or base station 
interpreting the data with respect to the (known) page structure. 
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The preferred embodiment of the netpage pen operates both as a normal 
marking ink pen and as a non-marking stylus. The marking aspect, however, is not 
necessary for using the netpage system as a browsing system, such as when it is used as 
an Internet interface. Each netpage pen is registered with the netpage system and has a 
5 unique pen ID 61. Figure 23 shows the netpage pen class diagram, reflecting pen-related 
information maintained by a registration server 1 1 on the netpage network. 

When either nib is in contact with a netpage, the pen determines its position 
and orientation relative to the page. The nib is attached to a force sensor, and the force 
on the nib is interpreted relative to a threshold to indicate whether the pen is "up" or 
1 0 "down". This allows a interactive element on the page to be 'clicked' by pressing with 
the pen nib, in order to request, say, information from a network. Furthermore, the force 
is captured as a continuous value to allow, say, the full dynamics of a signature to be 
verified. 

The pen determines the position and orientation of its nib on the netpage by 
15 imaging, in the infrared spectrum, an area 193 of the page in the vicinity of the nib. It 
decodes the nearest tag and computes the position of the nib relative to the tag from the 
observed perspective distortion on the imaged tag and the known geometry of the pen 
optics. Although the position resolution of the tag may be low, because the tag density 
on the page is inversely proportional to the tag size, the adjusted position resolution is 
20 quite high, exceeding the minimum resolution required for accurate handwriting 
recognition. 

Pen actions relative to a netpage are captured as a series of strokes. A stroke 
consists of a sequence of time-stamped pen positions on the page, initiated by a pen- 
down event and completed by the subsequent pen-up event. A stroke is also tagged with 
25 the page ID 50 of the netpage whenever the page ID changes, which, under normal 
circumstances, is at the commencement of the stroke. 

Each netpage pen has a current selection 826 associated with it, allowing the 
user to perform copy and paste operations etc. The selection is timestamped to allow the 
system to discard it after a defined time period. The current selection describes a region 
30 of a page instance. It consists of the most recent digital ink stroke captured through the 
pen relative to the background area of the page. It is interpreted in an application-specific 
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manner once it is submitted to an application via a selection hyperlink activation. 

Each pen has a current nib 824. This is the nib last notified by the pen to the 
system. In the case of the default netpage pen described above, either the marking black 
ink nib or the non-marking stylus nib is current. Each pen also has a current nib style 
5 825. This is the nib style last associated with the pen by an application, e.g. in response 
to the user selecting a color from a palette. The default nib style is the nib style 
associated with the current nib. Strokes captured through a pen are tagged with the 
current nib style. When the strokes are subsequently reproduced, they are reproduced in 
the nib style with which they are tagged. 

10 Whenever the pen is within range of a printer with which it can communicate, 

the pen slowly flashes its "online" LED. When the pen fails to decode a stroke relative to 
the page, it momentarily activates its "error" LED. When the pen succeeds in decoding a 
stroke relative to the page, it momentarily activates its "ok" LED. 

A sequence of captured strokes is referred to as digital ink. Digital ink forms 
1 5 the basis for the digital exchange of drawings and handwriting, for online recognition of 
handwriting, and for online verification of signatures. 

The pen is wireless and transmits digital ink to the netpage printer via a short- 
range radio link. The transmitted digital ink is encrypted for privacy and security and 
packetized for efficient transmission, but is always flushed on a pen-up event to ensure 
20 timely handling in the printer. 

When the pen is out-of-range of a printer it buffers digital ink in internal 
memory, which has a capacity of over ten minutes of continuous handwriting. When the 
pen is once again within range of a printer, it transfers any buffered digital ink. 

A pen can be registered with any number of printers, but because all state data 
25 resides in netpages both on paper and on the network, it is largely immaterial which 
printer a pen is communicating with at any particular time. 

A preferred embodiment of the pen is described in greater detail in Section 6 
below, with reference to Figures 8 to 10. 

1.7 Netpage Interaction 

30 The netpage printer 601 receives data relating to a stroke from the pen 101 
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when the pen is used to interact with a netpage 1 . The coded data 3 of the tags 4 is read 
by the pen when it is used to execute a movement, such as a stroke. The data allows the 
identity of the particular page and associated interactive element to be determined and an 
indication of the relative positioning of the pen relative to the page to be obtained. The 
5 indicating data is transmitted to the printer, where it resolves, via the DNS, the page ID 
50 of the stroke into the network address of the netpage page server 10 which maintains 
the corresponding page instance 830. It then transmits the stroke to the page server. If the 
page was recently identified in an earlier stroke, then the printer may already have the 
address of the relevant page server in its cache. Each netpage consists of a compact page 
10 layout maintained persistently by a netpage page server (see below). The page layout 
refers to objects such as images, fonts and pieces of text, typically stored elsewhere on 
the netpage network. 

When the page server receives the stroke from the pen, it retrieves the page 
description to which the stroke applies, and determines which element of the page 
1 5 description the stroke intersects. It is then able to interpret the stroke in the context of the 
type of the relevant element. 

A "click" is a stroke where the distance and time between the pen down 
position and the subsequent pen up position are both less than some small maximum. An 
object which is activated by a click typically requires a click to be activated, and 
20 accordingly, a longer stroke is ignored. The failure of a pen action, such as a "sloppy" 
click, to register is indicated by the lack of response from the pen's "ok" LED. 

There are two kinds of input elements in a netpage page description: hyperlinks 
and form fields. Input through a form field can also trigger the activation of an associated 
hyperlink. 

25 1.7.1 Hyperlinks 

A hyperlink is a means of sending a message to a remote application, and 

typically elicits a printed response in the netpage system. 

A hyperlink element 844 identifies the application 71 which handles activation 
of the hyperlink, a link ID 54 which identifies the hyperlink to the application, an "alias 
30 required" flag which asks the system to include the user's application alias ID 65 in the 
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hyperlink activation, and a description which is used when the hyperlink is recorded as a 
favorite or appears in the user's history. The hyperlink element class diagram is shown in 
Figure 29. 

When a hyperlink is activated, the page server sends a request to an application 
5 somewhere on the network. The application is identified by an application ID 64, and the 
application ID is resolved in the normal way via the DNS. There are three types of 
hyperlinks: general hyperlinks 863, form hyperlinks 865, and selection hyperlinks 864, 
as shown in Figure 30. A general hyperlink can implement a request for a linked 
document, or may simply signal a preference to a server. A form hyperlink submits the 
10 corresponding form to the application. A selection hyperlink submits the current 
selection to the application. If the current selection contains a single- word piece of text, 
for example, the application may return a single-page document giving the word's 
meaning within the context in which it appears, or a translation into a different language. 
Each hyperlink type is characterized by what information is submitted to the application. 

1 5 The corresponding hyperlink instance 862 records a transaction ID 55 which 

can be specific to the page instance on which the hyperlink instance appears. The 
transaction ID can identify user-specific data to the application, for example a "shopping 
cart" of pending purchases maintained by a purchasing application on behalf of the user. 

The system includes the pen's current selection 826 in a selection hyperlink 
20 activation. The system includes the content of the associated form instance 868 in a form 
hyperlink activation, although if the hyperlink has its "submit delta" attribute set, only 
input since the last form submission is included. The system includes an effective return 
path in all hyperlink activations. 

A hyperlinked group 866 is a group element 838 which has an associated 
25 hyperlink, as shown in Figure 31. When input occurs through any field element in the 
group, the hyperlink 844 associated with the group is activated. A hyperlinked group can 
be used to associate hyperlink behavior with a field such as a checkbox. It can also be 
used, in conjunction with the "submit delta" attribute of a form hyperlink, to provide 
continuous input to an application. It can therefore be used to support a "blackboard" 
30 interaction model, i.e. where input is captured and therefore shared as soon as it occurs. 
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1.7.2 Forms 

A form defines a collection of related input fields used to capture a related set 
of inputs through a printed netpage. A form allows a user to submit one or more 
parameters to an application software program running on a server. 

5 A form 867 is a group element 838 in the document hierarchy. It ultimately 

contains a set of terminal field elements 839. A form instance 868 represents a printed 
instance of a form. It consists of a set of field instances 870 which correspond to the field 
elements 845 of the form. Each field instance has an associated value 871, whose type 
depends on the type of the corresponding field element. Each field value records input 
1 0 through a particular printed form instance, i.e. through one or more printed netpages. The 
form class diagram is shown in Figure 32. 

Each form instance has a status 872 which indicates whether the form is active, 
frozen, submitted, void or expired. A form is active when first printed. A form becomes 
frozen once it is signed. A form becomes submitted once one of its submission 

1 5 hyperlinks has been activated, unless the hyperlink has its "submit delta" attribute set. A 
form becomes void when the user invokes a void form, reset form or duplicate form page 
command. A form expires when the time the form has been active exceeds the form's 
specified lifetime. While the form is active, form input is allowed. Input through a form 
which is not active is instead captured in the background field 833 of the relevant page 

20 instance. When the form is active or frozen, form submission is allowed. Any attempt to 
submit a form when the form is not active or frozen is rejected, and instead elicits an 
form status report. 

Each form instance is associated (at 59) with any form instances derived from 
it, thus providing a version history. This allows all but the latest version of a form in a 
25 particular time period to be excluded from a search. 

All input is captured as digital ink. Digital ink 873 consists of a set of 
timestamped stroke groups 874, each of which consists of a set of styled strokes 875. 
Each stroke consists of a set of timestamped pen positions 876, each of which also 
includes pen orientation and nib force. The digital ink class diagram is shown in Figure 
30 33. 

A field element 845 can be a checkbox field 877, a text field 878, a drawing 
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field 879, or a signature field 880. The field element class diagram is shown in Figure 
34. Any digital ink captured in a field's zone 58 is assigned to the field. 

A checkbox field has an associated boolean value 881, as shown in Figure 35. 
Any mark (a tick, a cross, a stroke, a fill zigzag, etc.) captured in a checkbox field's zone 
5 causes a true value to be assigned to the field's value. 

A text field has an associated text value 882, as shown in Figure 36. Any digital 
ink captured in a text field's zone is automatically converted to text via online 
handwriting recognition, and the text is assigned to the field's value. Online handwriting 
recognition is well-understood (see for example Tappert, C, C.Y. Suen and T. 
10 Wakahara, "The State of the Art in On-Line Handwriting Recognition", IEEE 
Transactions on Pattern Analysis and Machine Intelligence, Vol.12, No.8, August 1990). 

A signature field has an associated digital signature value 883, as shown in 
Figure 37. Any digital ink captured in a signature field's zone is automatically verified 
with respect to the identity of the owner of the pen, and a digital signature of the content 
15 of the form of which the field is part is generated and assigned to the field's value. The 
digital signature is generated using the pen user's private signature key specific to the 
application which owns the form. Online signature verification is well-understood (see 
for example Plamondon, R. and G. Lorette, "Automatic Signature Verification and 
Writer Identification - The State of the Art", Pattern Recognition, Vol.22, No.2, 1989). 

20 A field element is hidden if its "hidden" attribute is set. A hidden field element 

does not have an input zone on a page and does not accept input. It can have an 
associated field value which is included in the form data when the form containing the 
field is submitted. 

"Editing" commands, such as strike-throughs indicating deletion, can also be 
25 recognized in form fields. 

Because the handwriting recognition algorithm works "online" (i.e. with access 
to the dynamics of the pen movement), rather than "offline" (i.e. with access only to a 
bitmap of pen markings), it can recognize run-on discretely-written characters with 
relatively high accuracy, without a writer-dependent training phase. A writer-dependent 
30 model of handwriting is automatically generated over time, however, and can be 



NPT001US 




-34- 



generated up-front if necessary, 

Digital ink, as already stated, consists of a sequence of strokes. Any stroke 
which starts in a particular element's zone is appended to that element's digital ink 
stream, ready for interpretation. Any stroke not appended to an object's digital ink 
5 stream is appended to the background field's digital ink stream. 

Digital ink captured in the background field is interpreted as a selection 
gesture. Circumscription of one or more objects is generally interpreted as a selection of 
the circumscribed objects, although the actual interpretation is application-specific. 

Table 2 summarises these various pen interactions with a netpage. 

10 



Table 2. Summary of pen interactions with a netpage 



Object 


Type 


Pen input 


Action 


Hyperlink 


General 


Click 


Submit action to application 


Form 


Click 


Submit form to application 


Selection 


Click 


Submit selection to application 


Form field 


Checkbox 


Any mark 


Assign true to field 


Text 


Handwriting 


Convert digital ink to text; assign 
text to field 


Drawing 


Digital ink 


Assign digital ink to field 


Signature 


Signature 


Verify digital ink signature; 
generate digital signature of form; 
assign digital signature to field 


None 




Circumscription 


Assign digital ink to current 
selection 



The system maintains a current selection for each pen. The selection consists 
simply of the most recent stroke captured in the background field. The selection is 
1 5 cleared after an inactivity timeout to ensure predictable behavior. 

The raw digital ink captured in every field is retained on the netpage page 
server and is optionally transmitted with the form data when the form is submitted to the 
application. This allows the application to interrogate the raw digital ink should it 
suspect the original conversion, such as the conversion of handwritten text. This can, for 
20 example, involve human intervention at the application level for forms which fail certain 
application-specific consistency checks. As an extension to this, the entire background 
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area of a form can be designated as a drawing field. The application can then decide, on 
the basis of the presence of digital ink outside the explicit fields of the form, to route the 
form to a human operator, on the assumption that the user may have indicated 
amendments to the filled-in fields outside of those fields. 

5 Figure 38 shows a flowchart of the process of handling pen input relative to a 

netpage. The process consists of receiving (at 884) a stroke from the pen; identifying (at 

885) the page instance 830 to which the page ID 50 in the stroke refers; retrieving (at 

886) the page description 5; identifying (at 887) a formatted element 839 whose zone 58 
the stroke intersects; determining (at 888) whether the formatted element corresponds to 

10 a field element, and if so appending (at 892) the received stroke to the digital ink of the 
field value 871, interpreting (at 893) the accumulated digital ink of the field, and 
determining (at 894) whether the field is part of a hyperlinked group 866 and if so 
activating (at 895) the associated hyperlink; alternatively determining (at 889) whether 
the formatted element corresponds to a hyperlink element and if so activating (at 895) 

15 the corresponding hyperlink; alternatively, in the absence of an input field or hyperlink, 
appending (at 890) the received stroke to the digital ink of the background field 833; and 
copying (at 891) the received stroke to the current selection 826 of the current pen, as 
maintained by the registration server. 

Figure 38a shows a detailed flowchart of step 893 in the process shown in 
20 Figure 38, where the accumulated digital ink of a field is interpreted according to the 
type of the field. The process consists of determining (at 896) whether the field is a 
checkbox and (at 897) whether the digital ink represents a checkmark, and if so 
assigning (at 898) a true value to the field value; alternatively determining (at 899) 
whether the field is a text field and if so converting (at 900) the digital ink to computer 
25 text, with the help of the appropriate registration server, and assigning (at 901) the 
converted computer text to the field value; alternatively determining (at 902) whether the 
field is a signature field and if so verifying (at 903) the digital ink as the signature of the 
pen's owner, with the help of the appropriate registration server, creating (at 904) a 
digital signature of the contents of the corresponding form, also with the help of the 
30 registration server and using the pen owner's private signature key relating to the 
corresponding application, and assigning (at 905) the digital signature to the field value. 
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1 .7.3 Page Server Commands 

A page server command is a command which is handled locally by the page 

server. It operates directly on form, page and document instances. 

A page server command 907 can be a void form command 908, a duplicate 
5 form command 909, a reset form command 910, a get form status command 911, a 
duplicate page command 912, a reset page command 913, a get page status command 
914, a duplicate document command 915, a reset document command 916, or a get 
document status command 917, as shown in Figure 39. 

A void form command voids the corresponding form instance. A duplicate 
10 form command voids the corresponding form instance and then produces an active 
printed copy of the current form instance with field values preserved. The copy contains 
the same hyperlink transaction IDs as the original, and so is indistinguishable from the 
original to an application. A reset form command voids the corresponding form instance 
and then produces an active printed copy of the form instance with field values 
1 5 discarded. A get form status command produces a printed report on the status of the 
corresponding form instance, including who published it, when it was printed, for whom 
it was printed, and the form status of the form instance. 

Since a form hyperlink instance contains a transaction ID, the application has to 
be involved in producing a new form instance. A button requesting a new form instance 
20 is therefore typically implemented as a hyperlink. 

A duplicate page command produces a printed copy of the corresponding page 
instance with the background field value preserved. If the page contains a form or is part 
of a form, then the duplicate page command is interpreted as a duplicate form command. 
A reset page command produces a printed copy of the corresponding page instance with 
25 the background field value discarded. If the page contains a form or is part of a form, 
then the reset page command is interpreted as a reset form command. A get page status 
command produces a printed report on the status of the corresponding page instance, 
including who published it, when it was printed, for whom it was printed, and the status 
of any forms it contains or is part of. 

30 The netpage logo which appears on every netpage is usually associated with a 

duplicate page element. 
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When a page instance is duplicated with field values preserved, field values are 
printed in their native form, i.e. a checkmark appears as a standard checkmark graphic, 
and text appears as typeset text. Only drawings and signatures appear in their original 
form, with a signature accompanied by a standard graphic indicating successful signature 
5 verification. 

A duplicate document command produces a printed copy of the corresponding 
document instance with background field values preserved. If the document contains any 
forms, then the duplicate document command duplicates the forms in the same way a 
duplicate form command does. A reset document command produces a printed copy of 

10 the corresponding document instance with background field values discarded. If the 
document contains any forms, then the reset document command resets the forms in the 
same way a reset form command does. A get document status command produces a 
printed report on the status of the corresponding document instance, including who 
published it, when it was printed, for whom it was printed, and the status of any forms it 

15 contains. 

If the page server command's "on selected" attribute is set, then the command 
operates on the page identified by the pen's current selection rather than on the page 
containing the command. This allows a menu of page server commands to be printed. If 
the target page doesn't contain a page server command element for the designated page 
20 server command, then the command is ignored. 

An application can provide application-specific handling by embedding the 
relevant page server command element in a hyperlinked group. The page server activates 
the hyperlink associated with the hyperlinked group rather than executing the page server 
command. 

25 A page server command element is hidden if its "hidden" attribute is set. A 

hidden command element does not have an input zone on a page and so cannot be 
activated directly by a user. It can, however, be activated via a page server command 
embedded in a different page, if that page server command has its "on selected" attribute 
set. 
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1 .8 Standard Features of Netpages 

In the preferred form, each netpage is printed with the netpage logo at the 

bottom to indicate that it is a netpage and therefore has interactive properties. The logo 
also acts as a copy button. In most cases pressing the logo produces a copy of the page. 
5 In the case of a form, the button produces a copy of the entire form. And in the case of a 
secure document, such as a ticket or coupon, the button elicits an explanatory note or 
advertising page. 

The default single-page copy function is handled directly by the relevant 
netpage page server. Special copy functions are handled by linking the logo button to an 
1 0 application. 



15 



) User Help System 

In a preferred embodiment, the netpage printer has a single button labelled 

Help". When pressed it elicits a single page of information, including: 
status of printer connection 
status of printer consumables 
top-level help menu 
document function menu 
top-level netpage network directory 
The help menu provides a hierarchical manual on how to use the netpage 

20 system. 

The document function menu includes the following functions: 

• print a copy of a document 

• print a clean copy of a form 

• print the status of a document 

25 A document function is initiated by simply pressing the button and then 

touching any page of the document. The status of a document indicates who published it 
and when, to whom it was delivered, and to whom and when it was subsequently 
submitted as a form. 
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The netpage network directory allows the user to navigate the hierarchy of 
publications and services on the network. As an alternative, the user can call the netpage 
network "900" number "y e ^ ow pages" and speak to a human operator. The operator can 
locate the desired document and route it to the user's printer. Depending on the 
5 document type, the publisher or the user pays the small "yellow pages" service fee. 

The help page is obviously unavailable if the printer is unable to print. In this 
case the "error" light is lit and the user can request remote diagnosis over the network. 

2 Personalized Publication Model 

In the following description, news is used as a canonical publication example 

10 to illustrate personalization mechanisms in the netpage system. Although news is often 
used in the limited sense of newspaper and newsmagazine news, the intended scope in 
the present context is wider. 

In the netpage system, the editorial content and the advertising content of a 
news publication are personalized using different mechanisms. The editorial content is 
15 personalized according to the reader's explicitly stated and implicitly captured interest 
profile. The advertising content is personalized according to the reader's locality and 
demographic. 

2.1 Editorial Personalization 

A subscriber can draw on two kinds of news sources: those that deliver news 

20 publications, and those that deliver news streams. While news publications are 
aggregated and edited by the publisher, news streams are aggregated either by a news 
publisher or by a specialized news aggregator. News publications typically correspond to 
traditional newspapers and newsmagazines, while news streams can be many and varied: 
a "raw" news feed from a news service, a cartoon strip, a freelance writer's column, a 

25 friend's bulletin board, or the reader's own e-mail. 

The netpage publication server supports the publication of edited news 
publications as well as the aggregation of multiple news streams. By handling the 
aggregation and hence the formatting of news streams selected directly by the reader, the 
server is able to place advertising on pages over which it otherwise has no editorial 
30 control. 
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The subscriber builds a daily newspaper by selecting one or more contributing 
news publications, and creating a personalized version of each. The resulting daily 
editions are printed and bound together into a single newspaper. The various members of 
a household typically express their different interests and tastes by selecting different 
5 daily publications and then customizing them. 

For each publication, the reader optionally selects specific sections. Some 
sections appear daily, while others appear weekly. The daily sections available from The 
New York Times online, for example, include "Page One Plus", "National", 
"International", "Opinion", "Business", "Arts/Living", "Technology", and "Sports". The 
1 0 set of available sections is specific to a publication, as is the default subset. 

The reader can extend the daily newspaper by creating custom sections, each 
one drawing on any number of news streams. Custom sections might be created for e- 
mail and friends' announcements ("Personal"), or for monitoring news feeds for specific 
topics ("Alerts" or "Clippings"). 

15 For each section, the reader optionally specifies its size, either qualitatively 

(e.g. short, medium, or long), or numerically (i.e. as a limit on its number of pages), and 
the desired proportion of advertising, either qualitatively (e.g. high, normal, low, none), 
or numerically (i.e. as a percentage). 

The reader also optionally expresses a preference for a large number of shorter 
20 articles or a small number of longer articles. Each article is ideally written (or edited) in 
both short and long forms to support this preference. 

An article may also be written (or edited) in different versions to match the 
expected sophistication of the reader, for example to provide children's and adults' 
versions. The appropriate version is selected according to the reader's age. The reader 
25 can specify a "reading age" which takes precedence over their biological age. 

The articles which make up each section are selected and prioritized by the 
editors, and each is assigned a useful lifetime. By default they are delivered to all 
relevant subscribers, in priority order, subject to space constraints in the subscribers' 
editions. 

30 In sections where it is appropriate, the reader may optionally enable 
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collaborative filtering. This is then applied to articles which have a sufficiently long 
lifetime. Each article which qualifies for collaborative filtering is printed with rating 
buttons at the end of the article. The buttons can provide an easy choice (e.g. "liked" and 
"disliked'), making it more likely that readers will bother to rate the article. 

5 Articles with high priorities and short lifetimes are therefore effectively 

considered essential reading by the editors and are delivered to most relevant 
subscribers. 

The reader optionally specifies a serendipity factor, either qualitatively (e.g. do 
or don't surprise me), or numerically. A high serendipity factor lowers the threshold used 
1 0 for matching during collaborative filtering. A high factor makes it more likely that the 
corresponding section will be filled to the reader's specified capacity. A different 
serendipity factor can be specified for different days of the week. 

The reader also optionally specifies topics of particular interest within a 
section, and this modifies the priorities assigned by the editors. 

15 The speed of the reader's Internet connection affects the quality at which 

images can be delivered. The reader optionally specifies a preference for fewer images or 
smaller images or both. If the number or size of images is not reduced, then images may 
be delivered at lower quality (i.e. at lower resolution or with greater compression). 

At a global level, the reader specifies how quantities, dates, times and monetary 
20 values are localized. This involves specifying whether units are imperial or metric, a 
local timezone and time format, and a local currency, and whether the localization 
consist of in situ translation or annotation. These preferences are derived from the 
reader's locality by default. 

To reduce reading difficulties caused by poor eyesight, the reader optionally 
25 specifies a global preference for a larger presentation. Both text and images are scaled 
accordingly, and less information is accommodated on each page. 

The language in which a news publication is published, and its corresponding 
text encoding, is a property of the publication and not a preference expressed by the user. 
However, the netpage system can be configured to provide automatic translation services 
30 in various guises. 
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2.2 Advertising Localization and Targeting 

The personalization of the editorial content directly affects the advertising 

content, because advertising is typically placed to exploit the editorial context. Travel 
ads, for example, are more likely to appear in a travel section than elsewhere. The value 
5 of the editorial content to an advertiser (and therefore to the publisher) lies in its ability 
to attract large numbers of readers with the right demographics. 

Effective advertising is placed on the basis of locality and demographics. 
Locality determines proximity to particular services, retailers etc., and particular interests 
and concerns associated with the local community and environment. Demographics 
1 0 determine general interests and preoccupations as well as likely spending patterns. 

A news publisher's most profitable product is advertising "space", a multi- 
dimensional entity determined by the publication's geographic coverage, the size of its 
readership, its readership demographics, and the page area available for advertising. 

In the netpage system, the netpage publication server computes the 
15 approximate multi-dimensional size of a publication's saleable advertising space on a 
per-section basis, taking into account the publication's geographic coverage, the 
section's readership, the size of each reader's section edition, each reader's advertising 
proportion, and each reader's demographic. 

In comparison with other media, the netpage system allows the advertising 
20 space to be defined in greater detail, and allows smaller pieces of it to be sold separately. 
It therefore allows it to be sold at closer to its true value. 

For example, the same advertising "slot" can be sold in varying proportions to 
several advertisers, with individual readers' pages randomly receiving the advertisement 
of one advertiser or another, overall preserving the proportion of space sold to each 
25 advertiser. 

The netpage system allows advertising to be linked directly to detailed product 
information and online purchasing. It therefore raises the intrinsic value of the 
advertising space. 

Because personalization and localization are handled automatically by netpage 
30 publication servers, an advertising aggregator can provide arbitrarily broad coverage of 
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both geography and demographics. The subsequent disaggregation is efficient because it 
is automatic. This makes it more cost-effective for publishers to deal with advertising 
aggregators than to directly capture advertising. Even though the advertising aggregator 
is taking a proportion of advertising revenue, publishers may find the change profit- 
5 neutral because of the greater efficiency of aggregation. The advertising aggregator acts 
as an intermediary between advertisers and publishers, and may place the same 
advertisement in multiple publications. 

It is worth noting that ad placement in a netpage publication can be more 
complex than ad placement in the publication's traditional counterpart, because the 

1 0 publication's advertising space is more complex. While ignoring the full complexities of 
negotiations between advertisers, advertising aggregators and publishers, the preferred 
form of the netpage system provides some automated support for these negotiations, 
including support for automated auctions of advertising space. Automation is particularly 
desirable for the placement of advertisements which generate small amounts of income, 

1 5 such as small or highly localized advertisements. 

Once placement has been negotiated, the aggregator captures and edits the 
advertisement and records it on a netpage ad server. Correspondingly, the publisher 
records the ad placement on the relevant netpage publication server. When the netpage 
publication server lays out each user's personalized publication, it picks the relevant 
20 advertisements from the netpage ad server. 

2.3 User Profiles 

2.3.1 Information Filtering 

The personalization of news and other publications relies on an assortment of 

user-specific profile information, including: 
25 ♦ publication customizations 

• collaborative filtering vectors 

• contact details 

• presentation preferences 

The customization of a publication is typically publication-specific, and so the 
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customization information is maintained by the relevant netpage publication server. 

A collaborative filtering vector consists of the user's ratings of a number of 
news items. It is used to correlate different users' interests for the purposes of making 
recommendations. Although there are benefits to maintaining a single collaborative 
5 filtering vector independently of any particular publication, there are two reasons why it 
is more practical to maintain a separate vector for each publication: there is likely to be 
more overlap between the vectors of subscribers to the same publication than between 
those of subscribers to different publications; and a publication is likely to want to 
present its users' collaborative filtering vectors as part of the value of its brand, not to be 
10 found elsewhere. Collaborative filtering vectors are therefore also maintained by the 
relevant netpage publication server. 

Contact details, including name, street address, ZIP Code, state, country, 
telephone numbers, are global by nature, and are maintained by a netpage registration 
server. 

15 Presentation preferences, including those for quantities, dates and times, are 

likewise global and maintained in the same way. 

The localization of advertising relies on the locality indicated in the user's 
contact details, while the targeting of advertising relies on personal information such as 
date of birth, gender, marital status, income, profession, education, or qualitative 
20 derivatives such as age range and income range. 

For those users who choose to reveal personal information for advertising 
purposes, the information is maintained by the relevant netpage registration server. In the 
absence of such information, advertising can be targeted on the basis of the demographic 
associated with the user's ZIP or ZIP+4 Code. 

25 Each user, pen, printer, application provider and application is assigned its own 

unique identifier, and the netpage registration server maintains the relationships between 
them, as shown in Figures 21, 22, 23 and 24. For registration purposes, a publisher is a 
special kind of application provider, and a publication is a special kind of application. 

Each user 800 may be authorized to use any number of printers 802, and each 
30 printer may allow any number of users to use it. Each user has a single default printer (at 



NPT001US 



-45- 

66), to which periodical publications are delivered by default, whilst pages printed on 
demand are delivered to the printer through which the user is interacting. The server 
keeps track of which publishers a user has authorized to print to the user's default 
printer. A publisher does not record the ID of any particular printer, but instead resolves 
5 the ID when it is required. 

When a user subscribes 808 to a publication 807, the publisher 806 (i.e. 
application provider 803) is authorized to print to a specified printer or the user's default 
printer. This authorization can be revoked at any time by the user. Each user may have 
several pens 801, but a pen is specific to a single user. If a user is authorized to use a 
1 0 particular printer, then that printer recognizes any of the user's pens. 

The pen ID is used to locate the corresponding user profile maintained by a 
particular netpage registration server, via the DNS in the usual way. 

A Web terminal 809 can be authorized to print on a particular netpage printer, 
allowing Web pages and netpage documents encountered during Web browsing to be 
1 5 conveniently printed on the nearest netpage printer. 

The netpage system can collect, on behalf of a printer provider, fees and 
commissions on income earned through publications printed on the provider's printers. 
Such income can include advertising fees, click-through fees, e-commerce commissions, 
and transaction fees. If the printer is owned by the user, then the user is the printer 
20 provider. 

Each user also has a netpage account 820 which is used to accumulate micro- 
debits and credits (such as those described in the preceding paragraph); contact details 
815, including name, address and telephone numbers; global preferences 816, including 
privacy, delivery and localization settings; any number of biometric records 817, 
25 containing the user's encoded signature 818, fingerprint 819 etc; a handwriting model 
819 automatically maintained by the system; and SET payment card accounts 821 with 
which e-commerce payments can be made. 

2.3.2 Favorites List 

A netpage user can maintain a list 922 of "favorites" - links to useful 

30 documents etc. on the netpage network. The list is maintained by the system on the 
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user's behalf. It is organized as a hierarchy of folders 924, a preferrred embodiment of 
which is shown in the class diagram in Figure 41 . 

2.3.3 History List 

The system maintains a history list 929 on each user's behalf, containing links 

5 to documents etc. accessed by the user through the netpage system. It is organized as a 
date-ordered list, a preferred embodiment of which is shown in the class diagram in 
Figure 42. 

2.4 Intelligent Page Layout 

The netpage publication server automatically lays out the pages of each user's 

1 0 personalized publication on a section-by-section basis. Since most advertisements are in 
the form of pre-formatted rectangles, they are placed on the page before the editorial 
content. 

The advertising ratio for a section can be achieved with wildly varying 
advertising ratios on individual pages within the section, and the ad layout algorithm 
1 5 exploits this. The algorithm is configured to attempt to co-locate closely tied editorial 
and advertising content, such as placing ads for roofing material specifically within the 
publication because of a special feature on do-it-yourself roofing repairs. 

The editorial content selected for the user, including text and associated images 
and graphics, is then laid out according to various aesthetic rules. 

20 The entire process, including the selection of ads and the selection of editorial 

content, must be iterated once the layout has converged, to attempt to more closely 
achieve the user's stated section size preference. The section size preference can, 
however, be matched on average over time, allowing significant day-to-day variations. 

2.5 Document Format 

25 Once the document is laid out, it is encoded for efficient distribution and 

persistent storage on the netpage network. 

The primary efficiency mechanism is the separation of information specific to a 
single user's edition and information shared between multiple users' editions. The 
specific information consists of the page layout. The shared information consists of the 
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objects to which the page layout refers, including images, graphics, and pieces of text. 

A text object contains fully-formatted text represented in the Extensible 
Markup Language (XML) using the Extensible Stylesheet Language (XSL). XSL 
provides precise control over text formatting independently of the region into which the 
5 text is being set, which in this case is being provided by the layout. The text object 
contains embedded language codes to enable automatic translation, and embedded 
hyphenation hints to aid with paragraph formatting. 

An image object encodes an image in the JPEG 2000 wavelet-based 
compressed image format. A graphic object encodes a 2D graphic in Scalable Vector 
1 0 Graphics (SVG) format. 

The layout itself consists of a series of placed image and graphic objects, linked 
textflow objects through which text objects flow, hyperlinks and input fields as 
described above, and watermark regions. These layout objects are summarized in Table 
3. The layout uses a compact format suitable for efficient distribution and storage. 

15 



Table 3. Netpage layout objects 



Layout 
object 


Attribute 


Format of 
linked object 


Image 


Position 




Image object ID 


JPEG 2000 


Graphic 


Position 




Graphic object ID 


SVG 


Textflow 


Textflow ID 




Zone 




Optional text object ID 


XML/XSL 


Hyperlink 


Type 




Zone 




Application ID, etc. 




Field 


Type 




Meaning 




Zone 




Watermark 


Zone 





2.6 Document Distribution 

As described above, for purposes of efficient distribution and persistent storage 
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on the netpage network, a user-specific page layout is separated from the shared objects 
to which it refers. 

When a subscribed publication is ready to be distributed, the netpage 
publication server allocates, with the help of the netpage ID server 12, a unique ID for 
5 each page, page instance, document, and document instance. 

The server computes a set of optimized subsets of the shared content and 
creates a multicast channel for each subset, and then tags each user-specific layout with 
the names of the multicast channels which will carry the shared content used by that 
layout. The server then pointcasts each user's layouts to that user's printer via the 

10 appropriate page server, and when the pointcasting is complete, multicasts the shared 
content on the specified channels. After receiving its pointcast, each page server and 
printer subscribes to the multicast channels specified in the page layouts. During the 
multicasts, each page server and printer extracts from the multicast streams those objects 
referred to by its page layouts. The page servers persistently archive the received page 

1 5 layouts and shared content. 

Once a printer has received all the objects to which its page layouts refer, the 
printer re-creates the fully-populated layout and then rasterizes and prints it. 

Under normal circumstances, the printer prints pages faster than they can be 
delivered. Assuming a quarter of each page is covered with images, the average page has 
20 a size of less than 400KB. The printer can therefore hold in excess of 100 such pages in 
its internal 64MB memory, allowing for temporary buffers etc. The printer prints at a 
rate of one page per second. This is equivalent to 400KB or about 3Mbit of page data per 
second, which is similar to the highest expected rate of page data delivery over a 
broadband network. 

25 Even under abnormal circumstances, such as when the printer runs out of 

paper, it is likely that the user will be able to replenish the paper supply before the 
printer's 100-page internal storage capacity is exhausted. 

However, if the printer's internal memory does fill up, then the printer will be 
unable to make use of a multicast when it first occurs. The netpage publication server 
30 therefore allows printers to submit requests for re-multicasts. When a critical number of 
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requests is received or a timeout occurs, the server re-multicasts the corresponding 
shared objects. 

Once a document is printed, a printer can produce an exact duplicate at any 
time by retrieving its page layouts and contents from the relevant page server. 

5 2.7 On-Demand Documents 

When a netpage document is requested on demand, it can be personalized and 

delivered in much the same way as a periodical. However, since there is no shared 
content, delivery is made directly to the requesting printer without the use of multicast. 

When a non-netpage document is requested on demand, it is not personalized, 
1 0 and it is delivered via a designated netpage formatting server which reformats it as a 
netpage document. A netpage formatting server is a special instance of a netpage 
publication server. The netpage formatting server has knowledge of various Internet 
document formats, including Adobe's Portable Document Format (PDF), and Hypertext 
Markup Language (HTML). In the case of HTML, it can make use of the higher 
1 5 resolution of the printed page to present Web pages in a multi-column format, with a 
table of contents. It can automatically include all Web pages directly linked to the 
requested page. The user can tune this behavior via a preference. 

The netpage formatting server makes standard netpage behavior, including 
interactivity and persistence, available on any Internet document, no matter what its 
20 origin and format. It hides knowledge of different document formats from both the 
netpage printer and the netpage page server, and hides knowledge of the netpage system 
from Web servers. 

3 Security 

3.1 Cryptography 

25 Cryptography is used to protect sensitive information, both in storage and in 

transit, and to authenticate parties to a transaction. There are two classes of cryptography 
in widespread use: secret-key cryptography and public-key cryptography. The netpage 
network uses both classes of cryptography. 

Secret-key cryptography, also referred to as symmetric cryptography, uses the 
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same key to encrypt and decrypt a message. Two parties wishing to exchange messages 
must first arrange to securely exchange the secret key. 

Public-key cryptography, also referred to as asymmetric cryptography, uses two 
encryption keys. The two keys are mathematically related in such a way that any message 
5 encrypted using one key can only be decrypted using the other key. One of these keys is 
then published, while the other is kept private. The public key is used to encrypt any 
message intended for the holder of the private key. Once encrypted using the public key, 
a message can only be decrypted using the private key. Thus two parties can securely 
exchange messages without first having to exchange a secret key. To ensure that the 
1 0 private key is secure, it is normal for the holder of the private key to generate the key 
pair. 

Public-key cryptography can be used to create a digital signature. The holder of 
the private key can create a known hash of a message and then encrypt the hash using the 
private key. Anyone can then verify that the encrypted hash constitutes the "signature" of 
15 the holder of the private key with respect to that particular message by decrypting the 
encrypted hash using the public key and verifying the hash against the message. If the 
signature is appended to the message, then the recipient of the message can verify both 
that the message is genuine and that it has not been altered in transit. 

To make public-key cryptography work, there has to be a way to distribute 
20 public keys which prevents impersonation. This is normally done using certificates and 
certificate authorities. A certificate authority is a trusted third party which authenticates 
the connection between a public key and someone's identity. The certificate authority 
verifies the person's identity by examining identity documents, and then creates and 
signs a digital certificate containing the person's identity details and public key. Anyone 
25 who trusts the certificate authority can use the public key in the certificate with a high 
degree of certainty that it is genuine. They just have to verify that the certificate has 
indeed been signed by the certificate authority, whose public key is well-known. 

In most transaction environments, public-key cryptography is only used to 
create digital signatures and to securely exchange secret session keys. Secret-key 
30 cryptography is used for all other purposes. 

In the following discussion, when reference is made to the secure transmission 
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of information between a netpage printer and a server, what actually happens is that the 
printer obtains the server's certificate, authenticates it with reference to the certificate 
authority, uses the public key-exchange key in the certificate to exchange a secret session 
key with the server, and then uses the secret session key to encrypt the message data. A 
5 session key, by definition, can have an arbitrarily short lifetime. 

3.2 Netpage Printer Security 

Each netpage printer is assigned a pair of unique identifiers at time of 

manufacture which are stored in read-only memory in the printer and in the netpage 
registration server database. The first ID 62 is public and uniquely identifies the printer 
10 on the netpage network. The second ID is secret and is used when the printer is first 
registered on the network. 

When the printer connects to the netpage network for the first time after 
installation, it creates a signature public/private key pair. It transmits the secret ID and 
the public key securely to the netpage registration server. The server compares the secret 
15 ID against the printer's secret ID recorded in its database, and accepts the registration if 
the IDs match. It then creates and signs a certificate containing the printer's public ID 
and public signature key, and stores the certificate in the registration database. 

The netpage registration server acts as a certificate authority for netpage 
printers, since it has access to secret information allowing it to verify printer identity. 

20 When a user subscribes to a publication, a record is created in the netpage 

registration server database authorizing the publisher to print the publication to the user's 
default printer or a specified printer. Every document sent to a printer via a page server is 
addressed to a particular user and is signed by the publisher using the publisher's private 
signature key. The page server verifies, via the registration database, that the publisher is 

25 authorized to deliver the publication to the specified user. The page server verifies the 
signature using the publisher's public key, obtained from the publisher's certificate 
stored in the registration database. 

The netpage registration server accepts requests to add printing authorizations 
to the database, so long as those requests are initiated via a pen registered to the printer. 
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3.3 Netpage Pen Security 

Each netpage pen is assigned a unique identifier at time of manufacture which 

is stored in read-only memory in the pen and in the netpage registration server database. 
The pen ID 61 uniquely identifies the pen on the netpage network. 

5 A netpage pen can "know" a number of netpage printers, and a printer can 

"know" a number of pens. A pen communicates with a printer via a radio frequency 
signal whenever it is within range of the printer. Once a pen and printer are registered, 
they regularly exchange session keys. Whenever the pen transmits digital ink to the 
printer, the digital ink is always encrypted using the appropriate session key. Digital ink 
10 is never transmitted in the clear. 

A pen stores a session key for every printer it knows, indexed by printer ID, 
and a printer stores a session key for every pen it knows, indexed by pen ID. Both have a 
large but finite storage capacity for session keys, and will forget a session key on a least- 
recently-used basis if necessary. 

15 When a pen comes within range of a printer, the pen and printer discover 

whether they know each other. If they don't know each other, then the printer determines 
whether it is supposed to know the pen. This might be, for example, because the pen 
belongs to a user who is registered to use the printer. If the printer is meant to know the 
pen but doesn't, then it initiates the automatic pen registration procedure. If the printer 

20 isn't meant to know the pen, then it agrees with the pen to ignore it until the pen is 
placed in a charging cup, at which time it initiates the registration procedure. 

In addition to its public ID, the pen contains a secret key-exchange key. The 
key-exchange key is also recorded in the netpage registration server database at time of 
manufacture. During registration, the pen transmits its pen ID to the printer, and the 

25 printer transmits the pen ID to the netpage registration server. The server generates a 
session key for the printer and pen to use, and securely transmits the session key to the 
printer. It also transmits a copy of the session key encrypted with the pen's key-exchange 
key. The printer stores the session key internally, indexed by the pen ID, and transmits 
the encrypted session key to the pen. The pen stores the session key internally, indexed 

30 by the printer ID. 

Although a fake pen can impersonate a pen in the pen registration protocol, 
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only a real pen can decrypt the session key transmitted by the printer. 

When a previously unregistered pen is first registered, it is of limited use until 
it is linked to a user. A registered but "un-owned" pen is only allowed to be used to 
request and fill in netpage user and pen registration forms, to register a new user to 
5 which the new pen is automatically linked, or to add a new pen to an existing user. 

The pen uses secret-key rather than public-key encryption because of hardware 
performance constraints in the pen. 

3.4 Secure Documents 

The netpage system supports the delivery of secure documents such as tickets 

1 0 and coupons. The netpage printer includes a facility to print watermarks, but will only do 
so on request from publishers who are suitably authorized. The publisher indicates its 
authority to print watermarks in its certificate, which the printer is able to authenticate. 

The "watermark" printing process uses an alternative dither matrix in specified 
"watermark" regions of the page. Back-to-back pages contain mirror-image watermark 
1 5 regions which coincide when printed. The dither matrices used in odd and even pages' 
watermark regions are designed to produce an interference effect when the regions are 
viewed together, achieved by looking through the printed sheet. 

The effect is similar to a watermark in that it is not visible when looking at only 
one side of the page, and is lost when the page is copied by normal means. 

20 Pages of secure documents cannot be copied using the built-in netpage copy 

mechanism described in Section 1.9 above. This extends to copying netpages on 
netpage-aware photocopiers. 

Secure documents are typically generated as part of e-commerce transactions. 
They can therefore include the user's photograph which was captured when the user 
25 registered biometric information with the netpage registration server, as described in 
Section 2. 

When presented with a secure netpage document, the recipient can verify its 
authenticity by requesting its status in the usual way. The unique ID of a secure 
document is only valid for the lifetime of the document, and secure document IDs are 
30 allocated non-contiguously to prevent their prediction by opportunistic forgers. A secure 
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document verification pen can be developed with built-in feedback on verification 
failure, to support easy point-of-presentation document verification. 

Clearly neither the watermark nor the user's photograph are secure in a 
cryptographic sense. They simply provide a significant obstacle to casual forgery. Online 
5 document verification, particularly using a verification pen, provides an added level of 
security where it is needed, but is still not entirely immune to forgeries. 

3.5 Non-Repudiation 

In the netpage system, forms submitted by users are delivered reliably to forms 

handlers and are persistently archived on netpage page servers. It is therefore impossible 
1 0 for recipients to repudiate delivery. 

E-commerce payments made through the system, as described in Section 4, are 
also impossible for the payee to repudiate. 

4 Electronic Commerce Model 

4.1 Secure Electronic Transaction (SET) 

15 The netpage system uses the Secure Electronic Transaction (SET) system as 

one of its payment systems. SET, having been developed by MasterCard and Visa, is 
organized around payment cards, and this is reflected in the terminology. However, 
much of the system is independent of the type of accounts being used. 

In SET, cardholders and merchants register with a certificate authority and are 
20 issued with certificates containing their public signature keys. The certificate authority 
verifies a cardholder's registration details with the card issuer as appropriate, and verifies 
a merchant's registration details with the acquirer as appropriate. Cardholders and 
merchants store their respective private signature keys securely on their computers. 
During the payment process, these certificates are used to mutually authenticate a 
25 merchant and cardholder, and to authenticate them both to the payment gateway. 

SET has not yet been adopted widely, partly because cardholder maintenance of 
keys and certificates is considered burdensome. Interim solutions which maintain 
cardholder keys and certificates on a server and give the cardholder access via a 
password have met with some success. 
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4.2 SET Payments 

In the netpage system the netpage registration server acts as a proxy for the 

netpage user (i.e. the cardholder) in SET payment transactions. 

The netpage system uses biometrics to authenticate the user and authorize SET 
5 payments. Because the system is. pen-based, the biometric used is the user's on-line 
signature, consisting of time-varying pen position and pressure. A fingerprint biometric 
can also be used by designing a fingerprint sensor into the pen, although at a higher cost. 
The type of biometric used only affects the capture of the biometric, not the 
authorization aspects of the system. 

10 The first step to being able to make SET payments is to register the user's 

biometric with the netpage registration server. This is done in a controlled environment, 
for example a bank, where the biometric can be captured at the same time as the user's 
identity is verified. The biometric is captured and stored in the registration database, 
linked to the user's record. The user's photograph is also optionally captured and linked 

15 to the record. The SET cardholder registration process is completed, and the resulting 
private signature key and certificate are stored in the database. The user's payment card 
information is also stored, giving the netpage registration server enough information to 
act as the user's proxy in any SET payment transaction. 

When the user eventually supplies the biometric to complete a payment, for 
20 example by signing a netpage order form, the printer securely transmits the order 
information, the pen ID and the biometric data to the netpage registration server. The 
server verifies the biometric with respect to the user identified by the pen ID, and from 
then on acts as the user's proxy in completing the SET payment transaction. 

4.3 Micro-Payments 

25 The netpage system includes a mechanism for micro-payments, to allow the 

user to be conveniently charged for printing low-cost documents on demand and for 
copying copyright documents, and possibly also to allow the user to be reimbursed for 
expenses incurred in printing advertising material. The latter depends on the level of 
subsidy already provided to the user. 

30 When the user registers for e-commerce, a network account is established 

which aggregates micro-payments. The user receives a statement on a regular basis, and 
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can settle any outstanding debit balance using the standard payment mechanism. 

The network account can be extended to aggregate subscription fees for 
periodicals, which would also otherwise be presented to the user in the form of 
individual statements. 

5 4.4 Transactions 

When a user requests a netpage in a particular application context, the 

application is able to embed a user- specific transaction ID 55 in the page. Subsequent 
input through the page is tagged with the transaction ID, and the application is thereby 
able to establish an appropriate context for the user's input. 

10 When input occurs through a page which is not user-specific, however, the 

application must use the user's unique identity to establish a context. A typical example 
involves adding items from a pre-printed catalog page to the user's virtual "shopping 
cart". To protect the user's privacy, however, the unique user ID 60 known to the 
netpage system is not divulged to applications. This is to prevent different application 

1 5 providers from easily correlating independently accumulated behavioral data. 

The netpage registration server instead maintains an anonymous relationship 
between a user and an application via a unique alias ID 65, as shown in Figure 24. 
Whenever the user activates a hyperlink tagged with the "registered" attribute, the 
netpage page server asks the netpage registration server to translate the associated 
20 application ID 64, together with the pen ID 61, into an alias ID 65. The alias ID is then 
submitted to the hyperlink's application. 

The application maintains state information indexed by alias ID, and is able to 
retrieve user-specific state information without knowledge of the global identity of the 
user. 

25 The system also maintains an independent certificate and private signature key 

for each of a user's applications, to allow it to sign application transactions on behalf of 
the user using only application-specific information. 

To assist the system in routing product bar code (UPC) "hyperlink" activations, 
the system records a favorite application on behalf of the user for any number of product 
30 types. 
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Each application is associated with an application provider, and the system 
maintains an account on behalf of each application provider, to allow it to credit and 
debit the provider for click-through fees etc. 

An application provider can be a publisher of periodical subscribed content. 
5 The system records the user's willingness to receive the subscribed publication, as well 
as the expected frequency of publication. 

4.5 Resource Descriptions and Copyright 

A preferred embodiment of a resource description class diagram is shown in 

Figure 40. 

1 0 Each document and content object may be described by one or more resource 

descriptions 842. Resource descriptions use the Dublin Core metadata element set, 
which is designed to facilitate discovery of electronic resources. Dublin Core metadata 
conforms to the World Wide Web Consortium (W3C) Resource Description Framework 
(RDF). 

15 A resource description may identify rights holders 920. The netpage system 

automatically transfers copyright fees from users to rights holders when users print 
copyright content. 

5 Communications Protocols 

A communications protocol defines an ordered exchange of messages between 

20 entities. In the netpage system, entities such as pens, printers and servers utilise a set of 
defined protocols to cooperatively handle user interaction with the netpage system. 

Each protocol is illustrated by way of a sequence diagram in which the 
horizontal dimension is used to represent message flow and the vertical dimension is 
used to represent time. Each entity is represented by a rectangle containing the name of 
25 the entity and a vertical column representing the lifeline of the entity. During the time an 
entity exists, the lifeline is shown as a dashed line. During the time an entity is active, 
the lifeline is shown as a double line. Because the protocols considered here do not 
create or destroy entities, lifelines are generally cut short as soon as an entity ceases to 
participate in a protocol. 
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5.1 Subscription Delivery Protocol 

A preferred embodiment of a subscription delivery protocol is shown in Figure 

43. 

A large number of users may subscribe to a periodical publication. Each user's 
5 edition may be laid out differently, but many users' editions will share common content 
such as text objects and image objects. The subscription delivery protocol therefore 
delivers document structures to individual printers via pointcast, but delivers shared 
content objects via multicast. 

The application (i.e. publisher) first obtains a document ID 51 for each 
10 document from an ID server 12. It then sends each document structure, including its 
document ID and page descriptions, to the page server 10 responsible for the document's 
newly allocated ID. It includes its own application ID 64, the subscriber's alias ID 65, 
and the relevant set of multicast channel names. It signs the message using its private 
signature key. 

15 The page server uses the application ID and alias ID to obtain from the 

registration server the corresponding user ID 60, the user's selected printer ID 62 (which 
may be explicitly selected for the application, or may be the user's default printer), and 
the application's certificate. 

The application's certificate allows the page server to verify the message 
20 signature. The page server's request to the registration server fails if the application ID 
and alias ID don't together identify a subscription 808. 

The page server then allocates document and page instance IDs and forwards 
the page descriptions, including page IDs 50, to the printer. It includes the relevant set of 
multicast channel names for the printer to listen to. 

25 It then returns the newly allocated page IDs to the application for future 

reference. 

Once the application has distributed all of the document structures to the 
subscribers' selected printers via the relevant page servers, it multicasts the various 
subsets of the shared objects on the previously selected multicast channels. Both page 
30 servers and printers monitor the appropriate multicast channels and receive their required 
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content objects. They are then able to populate the previously pointcast document 
structures. This allows the page servers to add complete documents to their databases, 
and it allows the printers to print the documents. 

5.2 Hyperlink Activation Protocol 

5 A preferred embodiment of a hyperlink activation protocol is shown in Figure 

45. 

When a user clicks on a netpage with a netpage pen, the pen communicates the 
click to the nearest netpage printer 601. The click identifies the page and a location on 
the page. The printer already knows the ID 61 of the pen from the pen connection 
10 protocol. 

The printer determines, via the DNS, the network address of the page server 
10a handling the particular page ID 50. The address may already be in its cache if the 
user has recently interacted with the same page. The printer then forwards the pen ID, its 
own printer ID 62, the page ID and click location to the page server. 

15 The page server loads the page description 5 identified by the page ID and 

determines which input element's zone 58, if any, the click lies in. Assuming the 
relevant input element is a hyperlink element 844, the page server then obtains the 
associated application ID 64 and link ID 54, and determines, via the DNS, the network 
address of the application server hosting the application 71. 

20 The page server uses the pen ID 61 to obtain the corresponding user ID 60 from 

the registration server 11, and then allocates a globally unique hyperlink request ID 52 
and builds a hyperlink request 934. The hyperlink request class diagram is shown in 
Figure 44. The hyperlink request records the IDs of the requesting user and printer, and 
identifies the clicked hyperlink instance 862. The page server then sends its own server 

25 ID 53, the hyperlink request ID, and the link ID to the application. 

The application produces a response document according to application- 
specific logic, and obtains a document ID 51 from an ID server 12. It then sends the 
document to the page server 10b responsible for the document's newly allocated ID, 
together with the requesting page server's ID and the hyperlink request ID. 

30 The second page server sends the hyperlink request ID and application ID to 
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the first page server to obtain the corresponding user ID and printer ID 62. The first page 
server rejects the request if the hyperlink request has expired or is for a different 
application. 

The second page server allocates document instance and page IDs 50, returns 
5 the newly allocated page IDs to the application, adds the complete document to its own 
database, and finally sends the page descriptions to the requesting printer. 

The hyperlink instance may include a meaningful transaction ID 55, in which 
case the first page server includes the transaction ID in the message sent to the 
application. This allows the application to establish a transaction-specific context for the 
1 0 hyperlink activation. 

If the hyperlink requires a user alias, i.e. its "alias required" attribute is set, then 
the first page server sends both the pen ID 61 and the hyperlink's application ID 64 to 
the registration server 1 1 to obtain not just the user ID corresponding to the pen ID but 
also the alias ID 65 corresponding to the application ID and the user ID. It includes the 
1 5 alias ID in the message sent to the application, allowing the application to establish a 
user-specific context for the hyperlink activation. 

5.3 Handwriting Recognition Protocol 

When a user draws a stroke on a netpage with a netpage pen, the pen 

communicates the stroke to the nearest netpage printer. The stroke identifies the page 
20 and a path on the page. 

The printer forwards the pen ID 61, its own printer ID 62, the page ID 50 and 
stroke path to the page server 10 in the usual way. 

The page server loads the page description 5 identified by the page ID and 
determines which input element's zone 58, if any, the stroke intersects. Assuming the 
25 relevant input element is a text field 878, the page server appends the stroke to the text 
field's digital ink. 

After a period of inactivity in the zone of the text field, the page server sends 
the pen ID and the pending strokes to the registration server 1 1 for interpretation. The 
registration server identifies the user corresponding to the pen, and uses the user's 
30 accumulated handwriting model 822 to interpret the strokes as handwritten text. Once it 
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has converted the strokes to text, the registration server returns the text to the requesting 
page server. The page server appends the text to the text value of the text field. 

5.4 Signature Verification Protocol 

Assuming the input element whose zone the stroke intersects is a signature 

5 field 880, the page server 10 appends the stroke to the signature field's digital ink. 

After a period of inactivity in the zone of the signature field, the page server 
sends the pen ID 61 and the pending strokes to the registration server 1 1 for verification. 
It also sends the application ID 64 associated with the form of which the signature field 
is part, as well as the form ID 56 and the current data content of the form. The 

10 registration server identifies the user corresponding to the pen, and uses the user's 
dynamic signature biometric 818 to verify the strokes as the user's signature. Once it has 
verified the signature, the registration server uses the application ID 64 and user ID 60 to 
identify the user's application-specific private signature key. It then uses the key to 
generate a digital signature of the form data, and returns the digital signature to the 

1 5 requesting page server. The page server assigns the digital signature to the signature field 
and sets the associated form's status to frozen. 

The digital signature includes the alias ID 65 of the corresponding user. This 
allows a single form to capture multiple users' signatures. 

5.5 Form Submission Protocol 

20 A preferred embodiment of a form submission protocol is shown in Figure 46. 

Form submission occurs via a form hyperlink activation. It thus follows the 
protocol defined in Section 5.2, with some form-specific additions. 

In the case of a form hyperlink, the hyperlink activation message sent by the 
page server 10 to the application 71 also contains the form ID 56 and the current data 
25 content of the form. If the form contains any signature fields, then the application 
verifies each one by extracting the alias ID 65 associated with the corresponding digital 
signature and obtaining the corresponding certificate from the registration server 1 1 . 

5.6 Commission Payment Protocol 

A preferred embodiment of a commission payment protocol is shown in Figure 
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47. 

In an e-commerce environment, fees and commissions may be payable from an 
application provider to a publisher on click-throughs, transactions and sales. 
Commissions on fees and commissions on commissions may also be payable from the 
5 publisher to the provider of the printer. 

The hyperlink request ID 52 is used to route a fee or commission credit from 
the target application provider 70a (e.g. merchant) to the source application provider 70b 
(i.e. publisher), and from the source application provider 70b to the printer provider 72. 

The target application receives the hyperlink request ID from the page server 10 
10 when the hyperlink is first activated, as described in Section 5.2. When the target 
application needs to credit the source application provider, it sends the application 
provider credit to the original page server together with the hyperlink request ID. The 
page server uses the hyperlink request ID to identify the source application, and sends 
the credit on to the relevant registration server 1 1 together with the source application ID 
1 5 64, its own server ID 53, and the hyperlink request ID. The registration server credits the 
corresponding application provider's account 827. It also notifies the application 
provider. 

If the application provider needs to credit the printer provider, it sends the 
printer provider credit to the original page server together with the hyperlink request ID. 
20 The page server uses the hyperlink request ID to identify the printer, and sends the credit 
on to the relevant registration server together with the printer ID. The registration server 
credits the corresponding printer provider account 814. 

The source application provider is optionally notified of the identity of the 
target application provider, and the printer provider of the identity of the source 
25 application provider. 

6. Netpage Pen Description 

6.1 Pen Mechanics 

Referring to Figures 8 and 9, the pen, generally designated by reference 

numeral 101, includes a housing 102 in the form of a plastics moulding having walls 103 
30 defining an interior space 104 for mounting the pen components. The pen top 105 is in 
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operation rotatably mounted at one end 106 of the housing 102. A semi-transparent cover 
107 is secured to the opposite end 108 of the housing 102. The cover 107 is also of 
moulded plastics, and is formed from semi-transparent material in order to enable the 
user to view the status of the LED mounted within the housing 102. The cover 107 
5 includes a main part 109 which substantially surrounds the end 108 of the housing 102 
and a projecting portion 1 10 which projects back from the main part 109 and fits within 
a corresponding slot 111 formed in the walls 103 of the housing 102. A radio antenna 

1 12 is mounted behind the projecting portion 1 10 5 within the housing 102. Screw threads 

113 surrounding an aperture 1 13A on the cover 107 are arranged to receive a metal end 
10 piece 114, including corresponding screw threads 115. The metal end piece 114 is 

removable to enable ink cartridge replacement. 

Also mounted within the cover 107 is a tri-color status LED 1 16 on a flex PCB 
117. The antenna 112 is also mounted on the flex PCB 117. The status LED 116 is 
mounted at the top of the pen 101 for good all-around visibility. 

1 5 The pen can operate both as a normal marking ink pen and as a non-marking 

stylus. An ink pen cartridge 118 with nib 119 and a stylus 120 with stylus nib 121 are 
mounted side by side within the housing 102. Either the ink cartridge nib 119 or the 
stylus nib 121 can be brought forward through open end 122 of the metal end piece 1 14, 
by rotation of the pen top 105. Respective slider blocks 123 and 124 are mounted to the 

20 ink cartridge 118 and stylus 120, respectively. A rotatable cam barrel 125 is secured to 
the pen top 105 in operation and arranged to rotate therewith. The cam barrel 125 
includes a cam 126 in the form of a slot within the walls 181 of the cam barrel. Cam 
followers 127 and 128 projecting from slider blocks 123 and 124 fit within the cam slot 
126. On rotation of the cam barrel 125, the slider blocks 123 or 124 move relative to 

25 each other to project either the pen nib 1 19 or stylus nib 121 out through the hole 122 in 
the metal end piece 114. The pen 101 has three states of operation. By turning the top 
105 through 90° steps, the three states are: 

Stylus 120 nib 121 out; 

• Ink cartridge 118 nib 119 out; and 

30 • Neither ink cartridge 1 1 8 nib 1 1 9 out nor stylus 1 20 nib 1 2 1 out. 
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A second flex PCB 129, is mounted on an electronics chassis 130 which sits 
within the housing 102. The second flex PCB 129 mounts an infrared LED 131 for 
providing infrared radiation for projection onto the surface. An image sensor 132 is 
provided mounted on the second flex PCB 129 for receiving reflected radiation from the 
5 surface. The second flex PCB 129 also mounts a radio frequency chip 133, which 
includes an RF transmitter and RF receiver, and a controller chip 134 for controlling 
operation of the pen 101. An optics block 135 (formed from moulded clear plastics) sits 
within the cover 107 and projects an infrared beam onto the surface and receives images 
onto the image sensor 132. Power supply wires 136 connect the components on the 

10 second flex PCB 129 to battery contacts 137 which are mounted within the cam barrel 
125. A terminal 138 connects to the battery contacts 137 and the cam barrel 125. A three 
volt rechargeable battery 139 sits within the cam barrel 125 in contact with the battery 
contacts. An induction charging coil 140 is mounted about the second flex PCB 129 to 
enable recharging of the battery 139 via induction. The second flex PCB 129 also 

15 mounts an infrared LED 143 and infrared photodiode 144 for detecting displacement in 
the cam barrel 125 when either the stylus 120 or the ink cartridge 1 18 is used for writing, 
in order to enable a determination of the force being applied to the surface by the pen nib 
119 or stylus nib 121. The IR photodiode 144 detects light from the IR LED 143 via 
reflectors (not shown) mounted on the slider blocks 123 and 124. 

20 Rubber grip pads 141 and 142 are provided towards the end 108 of the housing 

102 to assist gripping the pen 101, and top 105 also includes a clip 142 for clipping the 
pen 101 to a pocket. 

6.2 Pen Controller 

The pen 101 is arranged to determine the position of its nib (stylus nib 121 or 

25 ink cartridge nib 1 19) by imaging, in the infrared spectrum, an area of the surface in the 
vicinity of the nib. It records the location data from the nearest location tag, and is 
arranged to calculate the distance of the nib 121 or 119 from the location tab utilising 
optics 135 and controller chip 134. The controller chip 134 calculates the orientation of 
the pen and the nib-to-tag distance from the perspective distortion observed on the 

30 imaged tag. 

Utilising the RF chip 133 and antenna 1 12 the pen 101 can transmit the digital 
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ink data (which is encrypted for security and packaged for efficient transmission) to the 
computing system. 

When the pen is in range of a receiver, the digital ink data is transmitted as it is 
formed. When the pen 101 moves out of range, digital ink data is buffered within the pen 
5 101 (the pen 101 circuitry includes a buffer arranged to store digital ink data for 
approximately 12 minutes of the pen motion on the surface) and can be transmitted later. 

The controller chip 134 is mounted on the second flex PCB 129 in the pen 101. 
Figure 10 is a block diagram illustrating in more detail the architecture of the controller 
chip 134. Figure 10 also shows representations of the RF chip 133, the image sensor 132, 
10 the tri-color status LED 116, the IR illumination LED 131, the IR force sensor LED 143, 
and the force sensor photodiode 144. 

The pen controller chip 134 includes a controlling processor 145. Bus 146 
enables the exchange of data between components of the controller chip 134. Flash 
memory 147 and a 512 KB DRAM 148 are also included. An analog-to-digital converter 
15 149 is arranged to convert the analog signal from the force sensor photodiode 144 to a 
digital signal. 

An image sensor interface 152 interfaces with the image sensor 132. A 
transceiver controller 153 and base band circuit 154 are also included to interface with 
the RF chip 133 which includes an RF circuit 155 and RF resonators and inductors 156 
20 connected to the antenna 112. 

The controlling processor 145 captures and decodes location data from tags 
from the surface via the image sensor 132, monitors the force sensor photodiode 144, 
controls the LEDs 116, 131 and 143, and handles short-range radio communication via 
the radio transceiver 153. It is a medium-performance (~40MHz) general-purpose RISC 
25 processor. 

The processor 145, digital transceiver components (transceiver controller 153 
and baseband circuit 154), image sensor interface 152, flash memory 147 and 512KB 
DRAM 148 are integrated in a single controller ASIC. Analog RF components (RF 
circuit 155 and RF resonators and inductors 156) are provided in the separate RF chip. 

30 The image sensor is a 215x215 pixel CCD (such a sensor is produced by 
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Matsushita Electronic Corporation, and is described in a paper by Itakura, K T 
Nobusada, N Okusenya, R Nagayoshi, and M Ozaki, "A 1mm 50k-Pixel IT CCD Image 
Sensor for Miniature Camera System", IEEE Transactions on Electronic Devices, Volt 
47, number 1, January 2000, which is incorporated herein by reference) with an IR filter. 

5 The controller ASIC 134 enters a quiescent state after a period of inactivity 

when the pen 101 is not in contact with a surface. It incorporates a dedicated circuit 150 
which monitors the force sensor photodiode 144 and wakes up the controller 134 via the 
power manager 151 on a pen-down event. 

The radio transceiver communicates in the unlicensed 900MHz band normally 
10 used by cordless telephones, or alternatively in the unlicensed 2.4GHz industrial, 
scientific and medical (ISM) band, and uses frequency hopping and collision detection to 
provide interference-free communication. 

In an alternative embodiment, the pen incorporates an Infrared Data 
Association (IrDA) interface for short-range communication with a base station or 
1 5 netpage printer. 

In a further embodiment, the pen 101 includes a pair of orthogonal 
accelerometers mounted in the normal plane of the pen 101 axis. The accelerometers 190 
are shown in Figures 9 and 10 in ghost outline. 

The provision of the accelerometers enables this embodiment of the pen 101 to 
20 sense motion without reference to surface location tags, allowing the location tags to be 
sampled at a lower rate. Each location tag ID can then identify an object of interest rather 
than a position on the surface. For example, if the object is a user interface input element 
(e.g. a command button), then the tag ID of each location tag within the area of the input 
element can directly identify the input element. 

25 The acceleration measured by the accelerometers in each of the x and y 

directions is integrated with respect to time to produce an instantaneous velocity and 
position. 

Since the starting position of the stroke is not known, only relative positions 
within a stroke are calculated. Although position integration accumulates errors in the 
30 sensed acceleration, accelerometers typically have high resolution, and the time duration 
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of a stroke, over which errors accumulate, is short. 

7. Netpage Printer Description 

7.1 Printer Mechanics 

The vertically-mounted netpage wallprinter 601 is shown fully assembled in 

5 Figure 11. It prints netpages on Letter/A4 sized media using duplexed 854" Memjet™ 
print engines 602 and 603, as shown in Figures 12 and 12a. It uses a straight paper path 
with the paper 604 passing through the duplexed print engines 602 and 603 which print 
both sides of a sheet simultaneously, in full color and with full bleed. 

An integral binding assembly 605 applies a strip of glue along one edge of each 
1 0 printed sheet, allowing it to adhere to the previous sheet when pressed against it. This 
creates a final bound document 618 which can range in thickness from one sheet to 
several hundred sheets. 

The replaceable ink cartridge 627, shown in Figure 13 coupled with the 
duplexed print engines, has bladders or chambers for storing fixative, adhesive, and 

1 5 cyan, magenta, yellow, black and infrared inks. The cartridge also contains a micro air 
filter in a base molding. The micro air filter interfaces with an air pump 638 inside the 
printer via a hose 639. This provides filtered air to the printheads to prevent ingress of 
micro particles into the Memjet™ printheads 350 which might otherwise clog the 
printhead nozzles. By incorporating the air filter within the cartridge, the operational life 

20 of the filter is effectively linked to the life of the cartridge. The ink cartridge is a fully 
recyclable product with a capacity for printing and gluing 3000 pages (1500 sheets). 

Referring to Figure 12, the motorized media pick-up roller assembly 626 
pushes the top sheet directly from the media tray past a paper sensor on the first print 
engine 602 into the duplexed Memjet™ printhead assembly. The two Memjet™ print 
25 engines 602 and 603 are mounted in an opposing in-line sequential configuration along 
the straight paper path. The paper 604 is drawn into the first print engine 602 by integral, 
powered pick-up rollers 626. The position and size of the paper 604 is sensed and full 
bleed printing commences. Fixative is printed simultaneously to aid drying in the 
shortest possible time. 

30 The paper exits the first Memjet™ print engine 602 through a set of powered 
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exit spike wheels (aligned along the straight paper path), which act against a rubberized 
roller. These spike wheels contact the 'wet' printed surface and continue to feed the 
sheet 604 into the second Memjet™ print engine 603. 

Referring to Figures 12 and 12a, the paper 604 passes from the duplexed print 
5 engines 602 and 603 into the binder assembly 605. The printed page passes between a 
powered spike wheel axle 670 with a fibrous support roller and another movable axle 
with spike wheels and a momentary action glue wheel. The movable axle/glue assembly 
673 is mounted to a metal support bracket and it is transported forward to interface with 
the powered axle 670 via gears by action of a camshaft. A separate motor powers this 
1 0 camshaft. 

The glue wheel assembly 673 consists of a partially hollow axle 679 with a 
rotating coupling for the glue supply hose 641 from the ink cartridge 627. This axle 679 
connects to a glue wheel, which absorbs adhesive by capillary action through radial 
holes. A molded housing 682 surrounds the glue wheel, with an opening at the front. 
15 Pivoting side moldings and sprung outer doors are attached to the metal bracket and 
hinge out sideways when the rest of the assembly 673 is thrust forward. This action 
exposes the glue wheel through the front of the molded housing 682. Tension springs 
close the assembly and effectively cap the glue wheel during periods of inactivity. 

As the sheet 604 passes into the glue wheel assembly 673, adhesive is applied 
20 to one vertical edge on the front side (apart from the first sheet of a document) as it is 
transported down into the binding assembly 605. 

7.2 Printer Controller Architecture 

The netpage printer controller consists of a controlling processor 750, a factory- 
installed or field-installed network interface module 625, a radio transceiver (transceiver 
25 controller 753, baseband circuit 754, RF circuit 755, and RF resonators and inductors 
756), dual raster image processor (RIP) DSPs 757, duplexed print engine controllers 
760a and 760b, flash memory 658, and 64MB of DRAM 657, as illustrated in Figure 14. 

The controlling processor handles communication with the network 19 and 
with local wireless netpage pens 101, senses the help button 617, controls the user 
30 interface LEDs 613-616, and feeds and synchronizes the RIP DSPs 757 and print engine 
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controllers 760. It consists of a medium-performance general-purpose microprocessor. 
The controlling processor 750 communicates with the print engine controllers 760 via a 
high-speed serial bus 659. 

The RIP DSPs rasterize and compress page descriptions to the netpage printer's 
5 compressed page format. Each print engine controller expands, dithers and prints page 
images to its associated Memjet™ printhead 350 in real time (i.e. at over 30 pages per 
minute). The duplexed print engine controllers print both sides of a sheet simultaneously. 

The master print engine controller 760a controls the paper transport and 
monitors ink usage in conjunction with the master QA chip 665 and the ink cartridge QA 
10 chip 761. 

The printer controller's flash memory 658 holds the software for both the 
processor 750 and the DSPs 757, as well as configuration data. This is copied to main 
memory 657 at boot time. 

The processor 750, DSPs 757, and digital transceiver components (transceiver 
15 controller 753 and baseband circuit 754) are integrated in a single controller ASIC 656. 
Analog RF components (RF circuit 755 and RF resonators and inductors 756) are 
provided in a separate RF chip 762. The network interface module 625 is separate, since 
netpage printers allow the network connection to be factory-selected or field-selected. 
Flash memory 658 and the 2x256Mbit (64MB) DRAM 657 is also off-chip. The print 
20 engine controllers 760 are provided in separate ASICs. 

A variety of network interface modules 625 are provided, each providing a 
netpage network interface 751 and optionally a local computer or network interface 752. 
Netpage network Internet interfaces include POTS modems, Hybrid Fiber-Coax (HFC) 
cable modems, ISDN modems, DSL modems, satellite transceivers, current and next- 
25 generation cellular telephone transceivers, and wireless local loop (WLL) transceivers. 
Local interfaces include IEEE 1284 (parallel port), lOBase-T and 100Base-T Ethernet, 
USB and USB 2.0, IEEE 1394 (Firewire), and various emerging home networking 
interfaces. If an Internet connection is available on the local network, then the local 
network interface can be used as the netpage network interface. 

30 The radio transceiver 753 communicates in the unlicensed 900MHz band 
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normally used by cordless telephones, or alternatively in the unlicensed 2.4GHz 
industrial, scientific and medical (ISM) band, and uses frequency hopping and collision 
detection to provide interference-free communication. 

The printer controller optionally incorporates an Infrared Data Association 
5 (IrDA) interface for receiving data "squirted" from devices such as netpage cameras. In 
an alternative embodiment, the printer uses the IrDA interface for short-range 
communication with suitably configured netpage pens. 

7.2.1 Rasterization and Printing 

Once the main processor 750 has received and verified the document's page 

1 0 layouts and page objects, it runs the appropriate RIP software on the DSPs 757. 

The DSPs 757 rasterize each page description and compress the rasterized page 
image. The main processor stores each compressed page image in memory. The simplest 
way to load-balance multiple DSPs is to let each DSP rasterize a separate page. The 
DSPs can always be kept busy since an arbitrary number of rasterized pages can, in 
1 5 general, be stored in memory. This strategy only leads to potentially poor DSP utilization 
when rasterizing short documents. 

Watermark regions in the page description are rasterized to a contone- 
resolution bi-level bitmap which is losslessly compressed to negligible size and which 
forms part of the compressed page image. 

20 The infrared (IR) layer of the printed page contains coded netpage tags at a 

density of about six per inch. Each tag encodes the page ID, tag ID, and control bits, and 
the data content of each tag is generated during rasterization and stored in the 
compressed page image. 

The main processor 750 passes back-to-back page images to the duplexed print 
25 engine controllers 760. Each print engine controller 760 stores the compressed page 
image in its local memory, and starts the page expansion and printing pipeline. Page 
expansion and printing is pipelined because it is impractical to store an entire 1 14MB bi- 
level CMYK+IR page image in memory. 
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7.2.2 Print Engine Controller 

The page expansion and printing pipeline of the print engine controller 760 

consists of a high speed IEEE 1394 serial interface 659, a standard JPEG decoder 763, a 
standard Group 4 Fax decoder 764, a custom halftoner/compositor unit 765, a custom tag 
5 encoder 766, a line loader/formatter unit 767, and a custom interface 768 to the 
Memjet™ printhead 350. 

The print engine controller 360 operates in a double buffered manner. While 
one page is loaded into DRAM 769 via the high speed serial interface 659, the 
previously loaded page is read from DRAM 769 and passed through the print engine 
10 controller pipeline. Once the page has finished printing, the page just loaded is printed 
while another page is loaded. 

The first stage of the pipeline expands (at 763) the JPEG-compressed contone 
CMYK layer, expands (at 764) the Group 4 Fax-compressed bi-level black layer, and 
renders (at 766) the bi-level netpage tag layer according to the tag format defined in 

15 section 1.2, all in parallel. The second stage dithers (at 765) the contone CMYK layer 
and composites (at 765) the bi-level black layer over the resulting bi-level CMYK layer. 
The resultant bi-level CMYK+IR dot data is buffered and formatted (at 767) for printing 
on the Memjet™ printhead 350 via a set of line buffers. Most of these line buffers are 
stored in the off-chip DRAM. The final stage prints the six channels of bi-level dot data 

20 (including fixative) to the Memjet™ printhead 350 via the printhead interface 768. 

When several print engine controllers 760 are used in unison, such as in a 
duplexed configuration, they are synchronized via a shared line sync signal 770. Only 
one print engine 760, selected via the external master/slave pin 771, generates the line 
sync signal 770 onto the shared line. 

25 The print engine controller 760 contains a low-speed processor 772 for 

synchronizing the page expansion and rendering pipeline, configuring the printhead 350 
via a low-speed serial bus 773, and controlling the stepper motors 675, 676. 

In the 814" versions of the netpage printer, the two print engines each prints 30 
Letter pages per minute along the long dimension of the page (1 1"), giving a line rate of 
30 8.8 kHz at 1600 dpi. In the 12" versions of the netpage printer, the two print engines 
each prints 45 Letter pages per minute along the short dimension of the page (814"), 
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giving a line rate of 10.2 kHz. These line rates are well within the operating frequency of 
the Memjet™ printhead, which in the current design exceeds 30 kHz. 

8 Netpage Tags 
8.1 Tag Tiling 

5 8.1 .1 Planar Surface Tag Tiling 

In order to support "single-click" interaction with a tagged region via a sensing 

device, the sensing device must be able to see at least one entire tag 4 in its field of view 
no matter where in the region or at what orientation it is positioned. The required 
diameter of the field of view of the sensing device is therefore a function of the size and 
1 0 spacing of the tags 4. 

In the case where the tag shape is circular, such as the preferred tag 4 described 
earlier, the minimum diameter m of the sensor field of view is obtained when the tags 
500, of diameter k y are tiled on an equilateral triangular grid, as shown in Figure 52 and 
defined in EQ 1. This is achieved when the center-to-center tag spacing is the same as 
1 5 the tag diameter k. 

With a tag diameter k of 256 dots (-4 mm at 1600 dpi), m is therefore 552 dots 
(-8.8 mm). With a quiet area of 16 dots, i.e. an effective tag diameter k of 272 dots (-4.3 
mm), m increases to 587 dots (-9.3 mm). 

When the tags 4 are moved a distance s apart, where s is at least as large as k 9 
20 then the minimum field of view is given by EQ 2. 

When no overlap is desired in the horizontal direction between successive lines 
of tags 500, for example to make tag rendering easier, the tags must be moved apart by a 
minimum amount given by EQ 3. For a 256-dot diameter tag, u is therefore 40 dots 
(-0.6 mm at 1600 dpi). Since this exceeds the quiet area required for the tag, the quiet 
25 area can be ignored if tag lines are rendered to not overlap. 

Setting s = k + u in EQ 2 gives EQ 4. For a 256-dot diameter tag, s is therefore 
296 dots (-4.7 mm at 1600 dpi), and m is 598 dots (-9.5 mm). 

8.1.2 Spherical Surface Tag Tiling 

A regular icosahedron is often used as the basis for generating an almost 
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regular triangular tiling of a sphere. A regular icosahedron, such as icosahedron 526 in 
Figure 53, is composed of twenty equal-sized equilateral triangular faces 528 sharing 
thirty edges 530 and twelve vertices 532, with five of the edges 530 meeting at each of 
the vertices 532. 

5 To achieve the required tiling, the icosahedron 526 is inscribed in a target 

sphere, and each triangle 528 of the icosahedron 526 is subdivided into an equal number 
of equal-sized equilateral subdivision triangles to yield the desired total number of 
triangles. If each edge 530 of the icosahedron is divided into v equal intervals, defining a 
set of v-1 points along each edge, and each pair of corresponding points along any two 

10 adjacent edges is joined by a line parallel to the other shared adjacent edge, the lines so 
drawn intersect at the vertices of the desired equal-sized and equilateral subdivision 
triangles, resulting in the creation of v 2 triangles per triangular face 528 of the 
icosahedron 526, or 20v 2 triangles in all. Of the resulting 10v 2 + 2 vertices, five 
triangular faces meet at each of the twelve original vertices of the icosahedron 526, and 

15 six triangular faces meet at the each of the remaining vertices. The twelve original 
vertices 532 already lie on the sphere, while the remaining vertices lie inside the sphere. 
Each created vertex is therefore centrally projected onto the sphere, giving the desired 
tiling. 

A sphere approximated by a regular polyhedron in this way is referred to as a 
20 geodesic, and the parameter v is referred to as the frequency of the geodesic. Figure 54 
shows an icosahedral geodesic 534 with v=3, i.e. with 180 faces 528. 

The closer a subdivision triangle is to the center of a face of the icosahedron 
526, the further it is from the surface of the sphere, and hence the larger it is when 
projected onto the sphere. To minimise variation in the size of projected subdivision 

25 triangles, subdivison vertices can systematically be displaced prior to projection 
(Tegmark, M., "An Icosahedron-Based Method for Pixelizing the Celestial Sphere", ApJ 
Letters, 470, L81, October 14, 1996). If v = 1 then no vertices are created and the angle 
subtended by a triangular face at a vertex remains 60°. As v increases, however, the 
surface defined by the five triangular faces surrounding each original vertex becomes 

30 increasingly flat, and the vertex angle of each triangular face converges on 72° (i.e. 360° 
/ 5). This defines the worst case for a tag tiling of a sphere. In a 72° isosceles triangle the 
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base length is 1.18 times the length of the two sides. The maximum tag spacing s for the 
purposes of calculating the sensor field of view is therefore close to 1.18A:. With a tag 
diameter of 256 dots and a quiet area of 16 dots, i.e. an effective tag diameter k of 272 
dots (-4.3 mm), m is therefore 643 dots (-10.2 mm) according to EQ 2. 

5 The angle subtended by each edge of an icosahedron at the center of the 

circumscribing sphere is given by EQ 5 

For a sphere of radius r the arc length of each centrally projected edge is r9. 
Given a tag diameter of K in the same units as r, the number of tags n required to cover 
the sphere is given by EQ 6. 

1 0 For a given w, r is limited by EQ 7. 

If n is limited to 2 16 , to allow the use of a 16-bit tag ID without requiring 
multiple regions to cover the sphere, and K is taken to be 4.3 mm as above, then r is 
limited to -310 mm. 

A typical globe has a radius of 160 mm. Its projected arc length of -177 mm 
15 fits 41 evenly spaced tags with negligible additional spacing. Such a globe uses 16812 
tags in total. 

8.1.3 Arbitrary Curved Surface Tag Tiling 

A triangle mesh can approximate a surface of arbitrary topography and 

topology without introducing discontinuities or singularities, with the local scale of the 
20 mesh being dictated by the local curvature of the surface and an error bound. Assuming 
the existence of a triangle mesh for a particular surface, an effective non-regular tiling of 
tags can be produced as long as each mesh triangle respects a minimum vertex angle and 
a minimum edge length. A tiling is considered effective with respect to a particular 
sensing device if the field of view of the sensing device is guaranteed to include at least 
25 one complete tag at any position of the sensing device on the surface. 

The tiling procedure starts by placing a tag at each vertex of the mesh, so the 
minimum edge length is the same as the tag diameter k. The tiling procedure proceeds by 
inserting a tag at the midpoint of any edge whose length exceeds a maximum tag 
separation s. As illustrated in Figure 9, the maximum tag spacing s is calculated so that if 
30 two adjacent tags 4a and 4b are a distance s + e apart, then there is room for another tag 



NPT001US 




-75- 



4c between them, i.e. EQ 8. 

However, if the vertex angle between two edges of length s + 8 is less than 60°, 
then the inserted tags will overlap. 

To prevent inserted tags from overlapping, a minimum tag separation t is 
5 introduced, where t > k. The minimum vertex angle a then becomes a function of k and t, 
as shown in EQ 9. 

Clearly, when / = k, p is constrained to be 60°, i.e. the mesh is constrained to be 
equilateral. But as illustrated in Figure 56, when t > k 9 p can be less than 60° without 
inserted tags overlapping. 

10 The maximum tag separation s must be based on the new minimum tag 

separation r, in accordance with EQ 10. 

When considering a particular mesh triangle, there are four distinct tag 
insertion scenarios. By assuming that the minimum vertex angle is no less than 30° (i.e. 
half of 60°), it can be shown that whenever a mesh triangle has at least one edge less 
15 than or equal to s in length, the remaining two edges are less than 2s in length. In 
practice the minimum vertex angle is typically at least 45°. 

In the first scenario (Figure 57) no edges of a triangle 546 exceed s in length, so 
the tagging of the triangle is already complete. 

In the second scenario (Figure 58) one edge 548 of a triangle 550 exceeds s in 
20 length. A tag 552 is inserted at the midpoint of the edge 548 to complete the tagging of 
the triangle 550. 

In the third scenario (Figure 59) two edges 554, 556 of a triangle 558 exceed s 
in length. Tags 560, 562 are inserted at the midpoint of each of the two long edges 554, 
556 and this may complete the tagging of the triangle 558. Centers of the two inserted 
25 tags 560, 562 together with the two vertices 564, 566 of the short edge 568 of the 
original triangle 558 form a trapezoid. If either diagonal of the trapezoid exceeds s in 
length then a final tag 570 is inserted at the center of the trapezoid to complete the 
tagging of the triangle. 

In the fourth scenario (Figure 60) all three edges 572 of a triangle 573 exceed s 
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in length. A tagged vertex 574 is inserted at the midpoint of each edge 572 and the three 
new vertices 574 are joined by edges 576. The tagging procedure is then recursively 
applied to each of the four resultant triangles 577, 578, 579 and 580. Note that the new 
triangles respect the minimum vertex angle because they have the same shape as the 
5 original triangle 573. 

The tag tiling variables are summarized in Table 4. 

Table 4. Tag tiling variables 



variable 


Meaning 


P 


minimum vertex angle 


k 


tag diameter 


m 


minimum diameter of sensor field of view on surface 


s 


maximum center-to-center tag spacing 


t 


minimum center-to-center tag spacing 



10 8.2 Tag Sensing 

8.2.1 Pen Orientation 

To allow a pen-like sensing device to be used as a comfortable writing 

instrument, a range of pen orientations must be supported. Since the pen nib is 
constrained to be in contact with the surface, the orientation of the pen can be 
1 5 characterized by the yaw (z rotation), pitch (x rotation) and roll (y rotation) of the pen, as 
illustrated in Figure 61. While the yaw of the pen must be unconstrained, it is reasonable 
to constrain the pitch and roll of the pen as well as the overall tilt of the pen resulting 
from the combination of pitch and roll. 

Yaw is conventionally applied after pitch, such that, for example, in the case of 
20 a pen device it would define a twist about the physical axis rather than a direction in the 
surface plane. In a pen with a marking nib, however, the image sensor is mounted off the 
axis of the pen and the pen's image sensing ability (and hence its yaw sensing ability) is 
therefore constrained unless the pen is held almost vertically, as discussed below. Yaw is 
therefore applied before pitch, allowing the full yaw range to be specified by rotating the 
25 pen relative to the surface while keeping pitch and roll constant. 
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Pitch and roll are conventionally defined as y and x rotations, respectively. 
Here they are defined as x and y rotations, respectively, because they are defined with 
respect to the x-y coordinate system of the surface, where the y axis is the natural 
longitudinal axis and the x axis is the natural lateral axis when viewed by a user. In a 
5 right-handed 3D coordinate system, roll is conventionally defined as positive when 
anticlockwise, while pitch and yaw are conventionally defined as positive when 
clockwise. Here all rotations are defined as positive when anticlockwise. 

The pen's overall tilt (9) is related to its pitch (<|>) and the roll (v|/) in accordance 
with EQ 11. 

1 0 The pen's tilt affects the scale at which surface features are imaged at different 

points in the field of view, and therefore affects the resolution of the image sensor. Since 
it is impractical to sense the area directly under the pen nib, the pen's tilt also affects the 
distance from the nib to the center of the imaged area. This distance must be known to 
allow a precise nib position to be derived from the position determined from the tag. 

15 8.2.2 Image Sensing 

The field of view can be modeled as a cone defined by a solid half-angle a 

(giving an angular field of view of 2a), and an apex height of D above the surface when 
the optical axis is vertical. Although the image sensor is typically rectangular, only the 
largest elliptical subarea of the image sensor is relevant to guaranteeing that a 
20 sufficiently large part of the surface is imaged, as quantified earlier. 

The intersection of the field of view cone with the surface defines an elliptical 
window on the surface. This window is circular when the optical axis is vertical. 

Figure 62 illustrates the geometric relationship, for a given pitch-related tilt 0 
of the pen's optical axis, between the pen's nib (point A), the pen's optical axis (CE), 
25 and the field of view window (FH). The tilt is defined to be clockwise positive from the 
vertical. The equations which follow apply to both positive and negative tilt. 

When the pen is not tilted, the window diameter (i.e. | BD | ) is given by EQ 

12. 

If, when the pen is not tilted, the distance from the nib to edge of the window 
30 (i.e. I AB | ) is 7, then the distance S from the nib to the center of the window (i.e. 
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| AC I ) is given by EQ 13. 

When the pen is tilted by 9, the distance from the viewpoint to the surface 
along the optical axis is reduced to d (i.e. I GE | ), given by EQ 14. 

The width of the window (i.e. | FH | ) is then given by EQ 15. 

5 D and a must be chosen so that an adequately large area is imaged throughout 

the supported tilt range. The required minimum diameter m of the area is given by EQ 4, 
while the width of the actual imaged area is given by EQ 15. This then gives EQ 16. 

Once D and a are determined, an image sensor resolution must be chosen so 
that the imaged area is adequately sampled, i.e. that the maximum feature frequency is 
1 0 sampled at its Nyquist rate or higher. 

When imaged, the scale of the surface decreases with increasing distance from 
the viewpoint and with increasing inclination relative to the viewing ray. Both factors 
have maximum effect at point F for positive tilt and point H for negative tilt, i.e. at the 
point in the window furthest from the viewpoint. Note that references to F in the 
1 5 following discussion apply to H when the tilt is negative. 

The distance of point F from the viewpoint (i.e. | EF | ) is given by EQ 17. 

Scaling due to the inclination of the surface relative to the viewing ray through 
F (EF) is given by EQ 18. 

If the surface feature frequency is f 9 then the angular surface feature frequency 
20 cd at F (i.e. with respect to the field of view) due to both factors is given by EQ 19. 

When there is no object plane tilt (i.e. 9 = 0), this reduces to EQ 20. 

The image sensor is, by definition, required to image at least the entire angular 
field of view. Since the pixel density of the image sensor is uniform, it must image the 
entire field of view at maximum frequency. Given an angular field of view in image 
25 space of 2a', an image sensor tilt (i.e. image plane tilt) with respect to the optical axis of 
0', and a sampling rate of n (where n > 2 according to Nyquist' s theorem), the minimum 
image sensor resolution q is given by EQ 21 and EQ 22. 

The cos-squared term in the numerator in EQ 22 results from the same 
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reasoning as the cos-squared term in the denominator in EQ 19. 

When there is no image plane tilt (i.e. 9' = 0), and the image space and object 
space angular fields of view are equal (i.e. a' = a), this reduces to EQ 23 and EQ 24. 

When there is no object plane tilt (i.e. 0 = 0) this reduces further to EQ 25. 

5 When the image plane tilt and the object plane tilt are equal (i.e. 0' = 0), and 

the image space and object space angular fields of view are equal (i.e. a' = a), EQ 22 
reduces to EQ 26. 

Matching the image plane tilt to the object plane tilt therefore yields a smaller 
required image sensor size than when the image sensor tilt is fixed at zero, and 
1 0 eliminates perspective distortion from the captured image. Variable image sensor tilt is, 
however, a relatively cotly option in practice, and also requires greater depth of field. 

Figure 63 illustrates the geometric relationship, for a given roll-related tilt 0 of 
the pen's optical axis, between the pen's nib (point A), the pen's optical axis (CE), and 
the field of view window (FH). The tilt is again defined to be clockwise positive from 
1 5 the vertical. With the exception of EQ 13, the preceding equations apply equally to roll- 
induced tilt. For roll-induced tilt the distance S from the nib to the center of the window 
(i.e. | AC | ) is zero rather than as defined by EQ 13. 

For pitch-induced tilt, the magnitude of the tilt range is maximised by choosing 
a minimum (negative) tilt and a maximum (positive) tilt which have the same image 
20 sensor requirement. Since, for pitch-induced tilt, the surface is more distant for negative 
tilt than for positive tilt of the same magnitude, the minimum has a smaller magnitude 
than the maximum. For roll-induced tilt they have the same magnitude. 

As described above, the smallest features of the tag 4 are the structures which 
encode the data bits, and these have a minimum diameter of 8 dots. This gives a 
25 maximum feature frequency / of about 7.9 per mm at 1600 dpi. 

As calculated according to EQ 4 above, an equilateral triangular tiling of 256- 
dot diameter tags with no overlap between successive lines of tags requires a minimum 
field of view window diameter on the surface of 598 dots, or about 9.5 mm at 1600 dpi. 

Most people hold a pen at about +30° pitch and 0° roll. The inking ball of a 
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ball-point nib loses effective contact with the surface beyond about +50° pitch (i.e. 40° 
from the horizontal). A reasonable target pitch range is therefore -10° to +50°, and a 
reasonable roll range -30° to +30°, bearing in mind greater limitations on combined 
pitch and roll as given by EQ 11. 

5 The highly compact (1.5 mm 2 ) Matsushita CCD image sensor (Matsushita 

Electronic Corporation, and is described in a paper by Itakura, K T Nobusada, N 
Okusenya, R Nagayoshi, and M Ozaki, "A 1mm 50k-Pixel IT CCD Image Sensor for 
Miniature Camera System", IEEE Transactions on Electronic Devices, Volt 47, number 
1, January 2000) is suitable for use in a compact device such as a pen. It has an available 

10 resolution of 215x215 pixels. Assuming equal image and object space angular fields of 
view, no image plane tilt, and a nib-to-window distance T of 4 mm, optimizing the 
geometry using EQ 16 and EQ 24 to achieve the desired pitch and roll ranges stated 
above yields a pitch range of -16° to +48° (64°) and a roll range of -28° to +28° (56°) 
with a viewing distance D of 30 mm and an angular field of view of 18.8° (a = 9.4°). 

15 The available pitch range is actually -21° to +43°, and this is mapped to close to the 
desired range by pitching the optical axis at -5° relative to the physical axis. Note that 
the tilt range can be expanded slightly by optimizing a non-zero tilt of the image plane. 

The overall pen tilt is thus confined to an elliptical cone whose major angle is 
64° in the pitch plane and whose minor angle is 56° in the roll plane. 

20 The image sensing variables are summarized in Table 5. 



Table 5. Image sensing variables 



variable 


meaning 


a 


Object space field of view half-angle 


a 


Image space field of view half-angle 


Y 


Pen yaw 


0 


Object plane tilt (i.e. overall pen tilt) 


e' 


Image plane tilt 


♦ 


Pen pitch 




Pen roll 


CO 


Angular frequency in field of view 


D 


Normal viewing distance 
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d 


Tilted viewing distance 


f 


Surface feature frequency 


n 


Sampling rate 


q 


Image sensor resolution 


s 


Distance from nib to center of field of view on 
surface (when 0 = 0) 


T 


Distance from nib to edge of field of view on 
surface (when 0 = 0) 



8.3 Tag Decoding 

8.3.1 Tag Image Processing and Decoding 

Tag image processing is described earlier in Section 1.2.4. It culminates in 

5 knowledge of the 2D perspective transform on the tag, as well as the decoded tag data. 

8.3.2 Inferring the Pen Transform 

Once the 2D perspective transform is obtained which accounts for the 

perspective distortion of the tag in the captured image, as described earlier, the 
corresponding discrete 3D tag transform with respect to the pen's optical axis can be 
1 0 inferred, as described below in Section 8.4. 

Once the discrete 3D tag transform is known, the corresponding 3D pen 
transform can be inferred, i.e. the transform of the pen's physical axis with respect to the 
surface. The pen's physical axis is the axis which is embodied in the pen's shape and 
which is experienced by the pen's user. It passes through the nib. The relationship 
1 5 between the physical axis and the optical axis is illustrated in Figure 64. 

It is convenient to define three coordinate spaces. In sensor space the optical 
axis coincides with the z axis and the the viewpoint is at the origin. In pen space the 
physical axis coincides with the z axis and the nib is at the origin. In tag space the tag 4 
lies in the x-y plane with its center at the origin. The tag transform transforms the tag 4 
20 from tag space to sensor space. 

Sensor space is illustrated in Figure 64. The labelling of points in Figure 64 is 
consistent with the labelling in Figure 62. The viewpoint is at E, the sensed point is at G, 



NPT001US 




-82- 

and the nib is at A. The intersection point G between the optical axis and the surface is 
referred to as the sensed point. In contrast with the geometry illustrated in Figure 62 
where the nib is considered as a point, here the nib is considered as a small sphere. If the 
nib is curved, then the tilt of the physical axis affects the offset between the sensed point 
5 and the contact point between the nib and the surface. The center point K of the spherical 
nib, about which the physical axis pivots, is referred to as the pivot point. 

The nib makes nominal contact with the surface at point A when the optical 
axis is vertical. KA is defined to be parallel to the optical axis. When the pen is tilted, 
however, contact is at point L, as shown in Figure 65. Given the radius R of the nib, the 
1 0 distance of the pivot point K from the surface, e.g. at A or L, is always R. 

The discrete tag transform includes the translation of the tag center from the 
sensed point, the 3D tag rotation, and the translation of the sensed point from the 
viewpoint. 

Given the translation d of the sensed point from the viewpoint in the discrete 
1 5 tag transform, and according to EQ 14, the sensed point is given by EQ 27. 

Since the physical axis only differs from the optical axis by a y translation and 
x rotation (i.e. pitch), the physical axis lies in the y-z plane. With reference to Figure 64, 
where I AC I = S and | EC | =D (just as in Figure 62), it is clear that in sensor space the 
position of the pivot point is given by EQ 28. 

20 The vector from the sensed point to the pivot point is therefore given by EQ 

29. 

The vector from the pivot point to the contact point is by definition a surface 
normal of length R. It is constructed by applying the 3D tag rotation M to a tag space 
surface normal, normalizing the result, and scaling by R, as shown in EQ 30 and EQ 31. 

25 The vector from the sensed point to the contact point is then obtained in 

accordance with EQ 32. 

This is transformed into tag space by applying the inverse of the tag transform 
3D rotation, and is then added to the vector from the tag center to the sensed point, to 
yield the vector from the tag center to the contact point in tag space, i.e. on the surface, 
30 in accordance with EQ 33. 
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This is finally added to the tag's absolute location, as implied by its tag ID, to 
yield the nib's desired absolute location in the tagged region: see EQ 34. 

The final step is to infer the pen's 3D orientation from the tag's 3D orientation. 
The pen's discrete rotations are simply the inverses of the tag's discrete rotations, with 
5 the pen's pitch also including the effect of the pitch of the optical axis with 

respect to the pen's axis, as defined in EQ 35, EQ 36 and EQ 37. 

8.4 Inferring the Tag Transform 

The image of the tag 4 captured by the image sensor contains perspective 

distortion due to the position and orientation of the image sensor with respect to the tag. 

10 Once the perspective targets of the tag are found in image space, an eight-degree-of- 
freedom perspective transform is inferred based on solving the well-understood 
equations relating the four tag-space and image-space point pairs. The discrete transform 
steps which give rise to the image of the tag are concatenated symbolically, and a set of 
simultaneous non-linear equations is obtained by equating corresponding terms in the 

1 5 concatenated transform and the perspective transform. Solving these equations yields the 
discrete transform steps, which include the desired tag offset from the nib, 3D tag 
rotation, and viewpoint offset from the surface. 

8.4.1 Modeling the Tag Transform 

The transform of the tag 4 from tag space to image space can be modeled as a 

20 concatenation of the following transform steps: 



x-y translate (by tag-to-viewpoint offset) 



z rotate (by tag yaw) 



x rotate (by tag pitch) 



y rotate (by tag roll) 



25 



z translate (by tag-to-viewpoint offset) 



perspective project (with specified focal length) 



x-y scale (to viewport size) 



These are concatenated symbolically to produce a single transform matrix 
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which effects the tag transform. Table 7 summarizes the discrete transform variables 
used in the following sections, together with the range of each variable. 

Table 7. Discrete transform variables and their ranges 



5 



Variable 


Abbrev. 


Meaning 


Unit 
transform 


Range 


Y 


- 


yaw 


0 


0 < y < 2n 


4> 


- 


pitch 


0 


-Till < (j)< 7l/2 


N> 


- 


roll 


0 


-7i/4 < vy < nIA 


fx 


A 


tag-to-viewpoint x offset 


0 




ty 


B 


tag-to-viewpoint y offset 


0 




V/vO j 


c 


cosine of yaw 


1 


-1 <C<1 


siny 


D 


sine of yaw 


0 


-1 <D< 1 


COS(|> 


E 


cosine of pitch 


1 


0 <E< 1 


sin<(> 


F 


sine of pitch 


0 


-1 <F< 1 


COS\|/ 


G 


cosine of roll 


1 


0<G<1 


sinvj; 


H 


sine of roll 


0 


-1 <H< 1 


tz 


1 


tag-to-viewpoint z offset 




/<0 


MX 


J 


inverse focal length 




J>0 


S 




viewport scale 




S>0 



Translate in x-y plane by t x and t y according to EQ 42 (where A=t x and B=t y ). 
Rotate about z by y according to EQ 43 (where C=cos(y) and D=sin(y)), giving 

EQ 44. 

1 0 Rotate about x by <|> according to EQ 45 (where £=cos(<t>) and F=sin((|))), giving 

EQ 46. 

Rotate about y by y according to EQ 47 (where G=cos(\|/) and //=sin(vj/)) ? 
giving EQ 48, where K and L are defined by EQ 49 and EQ 50. 

Translate in z by t 2 according to EQ 51 (where M 2 ), giving EQ 52. 

1 5 Perspective project with focal length X and projection plane at z=0 according to 

EQ 53 (where J=l A), giving EQ 54. 
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Scale to viewport by S according to EQ 55, giving EQ 56. 

Transform a point in the x-y plane (z=0) according to EQ 57, giving EQ 58. 

Finally, expand K and L, giving EQ 59. 

8.4.2 2D Perspective Transform 

5 Given an inferred eight-degree-of-freedom 2D perspective transform matrix as 

defined in EQ 60, multiply by an unknown /' to obtain the general nine-degree-of- 
freedom form of the matrix, as shown in EQ 61. 

Transform a 2D point according to EQ 62, giving EQ 63. 

8.4.3 Inferring the Tag Transform 

10 8.4.3.1 Equating Coefficients 

Equating the coefficients in EQ 59 with the coefficients in EQ 63 results in 

EQ 64 to EQ 72, being nine non-linear equations in 1 1 unknowns. 

These equations are augmented as required by the trigonometric identity 
relating the sine and cosine of an angle (i.e. the sine and cosine of any one of yaw, pitch 
1 5 and roll), as shown in EQ 73. 

Given the sine and cosine of an angle, the corresponding angle is obtained 
using a two-argument arctan as shown in EQ 74. 

8.4.3.2 Solving for X-Y Offset 

20 EQ 66 can be simplified using EQ 64 and EQ 65 to give EQ 75 and then EQ 

76. 

EQ 69 can be simplified using EQ 67 and EQ 68 to give EQ 77 and then EQ 

78. 

EQ 72 can be simplified using EQ 70 and EQ 71 to give EQ 79 and then EQ 

25 80. 

EQ 76 can be re-written as EQ 81, and EQ 78 can be re- written as EQ 82. 
Equating EQ 81 and EQ 82 and solving for 8 yields EQ 83 through EQ 85 and 
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finally EQ 86, which defines B. 

Substituting the value for B into EQ 82 and simplifying yields EQ 87 through EQ 
90 and finally EQ 91, which defines A 

This therefore gives the x-y offset of the tag 4 from the viewpoint, since A=t x 

5 and B=t y . 

8.4.3.3 Solving for Pitch 

From EQ 68, EQ 92 can be obtained. 

From EQ 67, EQ 93 can be obtained. 

From EQ 64, EQ 92 and EQ 93, EQ 94 can be obtained. 
1 0 From EQ 65, EQ 92 and EQ 93, EQ 95 can be obtained. 

From EQ 70, EQ 92 and EQ 93, EQ 96 can be obtained. 

From EQ 71, EQ 92 and EQ 93, EQ 97 can be obtained. 

From EQ 94, EQ 98 can be obtained. 

From EQ 95, EQ 99 can be obtained. 
1 5 From EQ 96, EQ 100 can be obtained. 

From EQ 97, EQ 101 can be obtained. 

From EQ 98 and EQ 99, EQ 102 and then EQ 103 can be obtained. 

From EQ 100 and EQ 101, EQ 104 and then EQ 105 can be obtained. 

From EQ 103 and EQ 105, EQ 106 and then EQ 107 can be obtained. 

20 EQ 107 only has a valid basis if G and H are both non-zero. Since |\|/| < 7t/2, the 

cosine (G) of the roll is always positive and hence non-zero. The sine of the roll (//) is 
only non-zero if the roll is non-zero. Specific handling for zero pitch and roll is 
described in Section 6.7.3.10. 

This therefore gives the magnitude of the sine of the pitch, since F=sin((|>), and 
25 hence the cosine (E) of the pitch by EQ 73, according to EQ 108. 

Since \§\ < 7t/2, the cosine (E) of the pitch is always positive, so there is no 
ambiguity when taking the square root. The sign of the sine (F), however, must be 
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determined by other means, as described in Section 6.7.3.9. 

Given E and F, the pitch is then obtained, according to EQ 109. 

8.4.3.4 Solving for Roll 

From EQ 103, EQ 110 can be obtained. 

5 From EQ 73, EQ 111 and then EQ 112 can be obtained. 

This therefore gives the magnitude of the sine of the roll, since #=sin(vy), and 
hence the cosine (G) of the roll by EQ 73, according to EQ 113. 

Since |\|/| < n/4, the cosine (G) of the roll is always positive, so there is no 
ambiguity when taking the square root. The sign of the sine (H), however, must be 
1 0 determined by other means, as described in Section 6.7.3.9. 

Given G and H, the roll is then obtained according to EQ 114. 

8.4.3.5 Solving for Yaw 

From EQ 73, EQ 92 and EQ 93, EQ 115 and then EQ 116 can be obtained. 

From EQ 92 and EQ 116, EQ 117 and then EQ 118 can be obtained. 

1 5 From EQ 92 and EQ 116, EQ 119 and then EQ 120 can be obtained. 

In EQ 116, and hence EQ 118 and EQ 120, the sign of the square root is 
determined by the sign of /, which can be determined from EQ 80, giving EQ 121. 

Since / (t z ) is negative, J (MX) is positive, and IJ < -1 (because \t z \ > X), then 
EQ 122 holds. 

20 Given C and D, the yaw is then obtained according to EQ 123. 

8.4.3.6 Solving for Viewport Scale 

The cosine (C) and sine (D) of the yaw are by definition never simultaneously 

zero. Since the cosine (£) of the pitch is never zero, either EQ 67 or EQ 68 can therefore 
always be used to determine the viewport scale (S). 

25 If D is non-zero, then from EQ 67, EQ 124 can be obtained. 

Otherwise, if C is non-zero, then from EQ 68, EQ 125 can be obtained. 
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8.4.3.7 Solving for Focal Length 

Similarly, since the cosine (G) of the roll is never zero, either EQ 70 or EQ 71 

can be used to determine the inverse focal length (J), so long as either the pitch or roll is 
non-zero. However, the signs of the sines (F and H) of the pitch and roll may not be 
5 known. However, the sign of the product (FH) of the sines of the pitch and roll is given 
by EQ 103, as shown in EQ 126. 

The sign can be assigned arbitrarily to F, since the sign of J is known a priori. 
If gi is non-zero, then from EQ 70, EQ 127 can be obtained. 

If hi is non-zero, then from EQ 71, EQ 128 can be obtained. 

10 In practice, the choice between using EQ 127 and EQ 128 is based on which of 

gi and hi has the larger magnitude. The inverse focal length is unknown if gi and hi are 
both zero, i.e. if the pitch and roll are both zero. 

8.4.3.8 Solving for Z Offset 

Once the inverse focal length (J) is known, the z offset (I) is obtained from EQ 

1 5 80, according to EQ 129. 

Again, the z offset (7) is unknown if the inverse focal length (J) is unknown, i.e. 
if the pitch and roll are both zero. 

8.4.3.9 Determining Direction of Pitch and Roll 

The sign of the product (FH) of the sines of the pitch and roll is given by EQ 

20 126. Since -n/4 < \\f < n/4, a roll adjustment of +7i/4 can be introduced to ensure the roll 
is always positive, without invalidating any other assumptions. Once the roll adjustment 
is introduced, EQ 126 gives the sign of the sine (F) of the pitch alone. 

The roll adjustment is introduced as follows. The viewport scale (5), inverse 
focal length (J), and z offset (I) are all computed as described. A 3D transform matrix is 
25 created from the 2D perspective transform matrix. The inverses of the viewport scale, 
focal length projection and z translation are applied to the 3D matrix in reverse order. 
The roll adjustment is then applied by pre-multiplying the matrix by a n/4 y rotation 
matrix. The roll, pitch and yaw are computed as described. Since the roll is positive, the 
pitch direction is now known. The n/4 roll adjustment is finally subtracted from the roll 
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to give the actual roll. 

When the roll and pitch are both zero, the focal length and z offset are both 
unknown as described above. However, in this case there is no need to adjust the roll 
since the pitch and roll are already known. 

5 8.4.3.10 Handling Zero Pitch and Roll 

When either the pitch or roll is zero, the general solution based on EQ 107 

becomes invalid. The table of Figure 85 shows the 12 degenerate forms of EQ 64 
through EQ 71 which result when the yaw is variously zero (or 7i), n/2 (or 37i/2), and 
non-zero, and the pitch and roll are variously zero and non-zero. The table of Figures 86 
1 0 and 87 sets out the required logic for detecting and handling cases where the pitch and/or 
roll are zero, with each case motivated by zeros appearing in the table of Figure 85. The 
cases in the table of Figure 85 are labelled with the case numbers from the table of 
Figures 86 and 87. 

Conclusion 

15 The present invention has been described with reference to a preferred 

embodiment and number of specific alternative embodiments. However, it will be 
appreciated by those skilled in the relevant fields that a number of other embodiments, 
differing from those specifically described, will also fall within the spirit and scope of 
the present invention. Accordingly, it will be understood that the invention is not 

20 intended to be limited to the specific embodiments described in the present specification, 
including documents incorporated by cross-reference as appropriate. The scope of the 
invention is only limited by the attached claims. 
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